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Preface 


Coronaviruses were recognized as a group of enveloped, RNA viruses in 1968 
and accepted by the International Committee on the Taxonomy of Viruses as a 
separate family, the Coronaviridae, in 1975. By 1978, it had become evident that 
the coronavirus genomic RNA was infectious {i.e., positive strand], and by 1983, 
at least the framework of the coronavirus replication strategy had been per- 
ceived. Subsequently, with the application of recombinant DNA techniques, 
there have been remarkable advances in our understanding of the molecular 
biology of coronaviruses, and a mass of structural data concerning coronavirus 
genomes, mRNAs, and proteins now exists. More recently, attention has been 
focused on the role of essential and accessory gene products in the coronavirus 
replication cycle and a molecular analysis of the structure- function relation- 
ships of coronavirus proteins. Nevertheless, there are still large gaps in our 
knowledge, for instance, in areas such as the genesis of coronavirus subgenomic 
mRNAs or the function of the coronavirus RNA-dependent RNA polymerase. 

The diseases caused by coronaviruses have been known for much longer 
than the agents themselves. Possibly the first coronavirus-related disease to be 
recorded was feline infectious peritonitis, as early as 1912. The diseases associ- 
ated with infectious bronchitis virus, transmissible gastroenteritis virus, and 
murine hepatitis virus were all well known before 1950. However, it was only 
the realization in the late 1960s that a coronavirus was responsible for a human 
disease, the common cold, that brought these viruses to the attention of aca- 
demic virologists. The advances of the last few years, for example, in the 
molecular analysis of coronavirus protein immunogenicity or the identification 
of coronavirus receptors, have undoubtedly given an impetus to studies on 
coronavirus pathogenesis. At the same time, however, they have also begun to 
reveal the complexities of the coronavirus-host relationship. 

In parallel with the studies on coronaviruses, recent insights into the 
genome organization, replication strategy, and nucleotide sequences of two 
other groups of viruses, the toroviruses and the arteriviruses, have led to the 
idea of a "coronaviruslike" superfamily. This idea, which is based essentially on 
a molecular view of evolution, led, in 1992, to the inclusion of the toroviruses as 
a second genus in the Coronaviridae. The question of how the evolutionary 
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links between the Coronaviridae and the arteriviruses should be reflected in 
their taxonomic status is still open to discussion. 

This volume consists essentially of two parts. The first 12 chapters discuss 
the molecular aspects of coronavirus and torovirus biology. First, the unique 
features of coronavirus replication, including the genome organization, tran- 
scription, and translation strategies, and the importance of RNA recombination 
are reviewed. This is followed by a chapter on coronavirus receptors. Then each 
of the virus structural and nonstructural proteins is described in detail. Finally, 
the molecular biology of toroviruses and the idea of a "coronaviruslike" super- 
family is discussed. In the second part of the volume, which consists of seven 
chapters, selected biological aspects of the most important coronavirus and 
torovirus infections are covered. These range from the experimental study of 
murine coronavirus infections to the problems of coronavirus infection in 
poultry and swine. 

I hope that this volume will be useful to both academic researchers and 
their students, as well as clinicians and veterinarians with an interest in 
coronavirus-related diseases. I would like to thank the contributors, the staff of 
Plenum Press, and my wife for their patience during the preparation of this 
volume. 


Stuart G. Siddell 
University of Würzburg 
Würzburg, Germany 
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CHAPTER 1 


The Coronaviridae 


An Introduction 


STUART G. SIDDELL 


I. INTRODUCTION 


The Coronaviridae are a group of enveloped, positive-strand RNA viruses with 
nonsegmented genomes of about 30,000 nucleotides. The family comprises two 
genera, coronavirus and torovirus, which share similarities in the organization 
and expression of their genomes and the structure of the viral gene products. 
Viruses in the two genera have, however, only limited sequence similarity and 
display; for example, distinctly different nucleocapsid morphologies. 

In this introductory chapter, I shall briefly discuss the classification of the 
Coronaviridae, the organization of corona- and torovirus genomes, and the 
morphology and physicochemicai properties of the virus particles. The molecu- 
lar aspects of coronavirus and torovirus biology are discussed extensively in 
Chapters 2 to 12, and Chapters 13 to 19 deal with the biological aspects of 
coronavirus and torovirus infections. 


II. CLASSIFICATION 

Viruses that are classified in the Coronaviridae fulfill the following cri- 
teria: 

* A positive-strand, nonsegmented genome of about 30,000 nucleotides. 


The genome contains an unusually large RNA polymerase gene (about 
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20,000 nucleotides}, a large surface glycoprotein gene, an integral mem- 
brane protein gene, and a nucleocapsid protein gene. These genes are 
arranged on the genome in a specific order (as described above in the 5’ > 
3’ direction) but may be interspersed with additional genes encoding 
further structural or nonstructural proteins. 

e A virion envelope bearing pronounced surface projections. These projec- 
tions are composed of the large surface glycoprotein (M, = -200 kDa} that 
characteristically exhibits a coiled-coil structure in the carboxy-terminal, 
membrane-anchoring half. 

e An integral membrane protein (M, - -25 kDa). This protein charac- 
teristically has three membrane-spanning regions in the amino-terminal 
half. 

e A 3’ coterminal set of four or more intracellular subgenomic mRNAs. 
Only the open reading frames (ORFs) contained within the 5’ unique 
region of each mRNA (i.e., the region not found in the next smallest 
mRNA) are expressed as protein. 

e An RNA polymerase gene composed of two overlapping ORFs. Expression 
of the downstream ORF is mediated by (—1) ribosomal frameshifting. 


The features that distinguish coronaviruses and toroviruses are: 


* The size of the nucleocapsid protein (M, - -60 kDa and -18 kDa, respec- 
tively) and the shape of the helical nucleocapsid structure (extended or 
tubular, respectively). 

e The absence of a leader sequence at the 5' end of torovirus mRNAs. 

e With some limited exceptions, a lack of sequence similarity in the corona- 
virus and torovirus gene products. 


At the present time 13 viruses are classified as species in the Coronaviridae 
(Cavanagh et al., 1994). These are listed with their acronyms and natural hosts 
in Table I. 

The prototype of the coronavirus genus is avian infectious bronchitis virus. 
The name “coronavirus” is derived from the solar corona-like (L. corona = 
crown) appearance of virus particles in negatively stained electron micrographs. 
The prototype of the torovirus genus is Berne virus (BEV) and the name 
“torovirus” is derived from the curved tubular (L. torus = lowest convex mold- 
ing in the base of a column) morphology of the nucleocapsid structure. 

In addition to the viruses listed in Table I, two viruses, rat coronavirus 
(RCV) and rabbit coronavirus (RbCV}), are considered as tentative species of the 
coronavirus genus. Three more viruses, feline enteric coronavirus (FECV}], 
sialoacryoadenitis virus (SADV|, and porcine respiratory coronavirus (PRCV|, 
are often discussed in the literature as coronavirus species, but the available 
data suggest that they could be equally well considered as variants of feline 
infectious peritonitis virus (FIPV), RCV, and transmissible gastroenteritis virus 
(TGEV), respectively. 

In relation to the torovirus genus, there are reports of an enveloped virus in 
the stools of patients with gastroenteritis that resembles Breda viruses (Beards 
et al., 1984). However, this virus is not yet recognized as a member of the genus. 
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TABLE I. Coronaviridae 


Natural host Virus Acronym 
Coronavirus 
Chicken Avian infectious bronchitis virus IBV 
Cattle Bovine coronavirus BCV 
Dog Canine coronavirus CCV 
Man Human coronavirus 229E HCV 229E 
Man Human coronavirus OC43 HCV OC43 
Cat Feline infectious peritonitis virus FIPV 
Mouse Murine hepatitis virus MHV 
Pig Porcine epidemic diarrhea virus PEDV 
Pig Porcine hemagglutinating encephalomyelitis virus HEV 
Pig Porcine transmissible gastroenteritis virus TGEV 
Turkey Turkey coronavirus TCV 
Torovirus 
Horse Berne virus BEV 
Cattle Breda virus BRV 


A. Serological Relationships 


A variety of serological assays have been used to determine the antigenic 


relationships among coronaviruses. They have included neutralization, hemag- 
glutination-inhibition, immunofluorescence, immunoblotting, enzyme-linked 
immunosorbent assay (ELISA), and radioimmunoprecipitation using poly- 
valent and monovalent antisera and monoclonal antibodies (Spaan et al., 1990; 
Dea et al., 1990]. The results of these studies can be summarized by a taxonomy 
placing the 11 coronavirus species in three antigenic groups. These groups are 
shown in Table II. 

The inclusion of human coronavirus (HCV) 229E in antigenic group 1 has 
been questioned, essentially because no cross-reactivity could be demonstrated 
between HCV 229E and a large panel of TGEV-specific monoclonal antibodies 
(Sanchez et al., 1990). However, the sequence relationships of coronavirus 
proteins appear to support the inclusion of HCV 229E in this group. 


TABLE II. Antigenic 
Relationships of Coronaviruses 


Group 1 Group 2 Group 3 
HCV 229E HCV OC43 IBV 
TGEV MHV 

PEDV BCV 

CCV HEV 

FIPV TCV 
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PRCV (86/137004) 
PRCV (RM4) 
TGEV (FS772/70) 
TGEV (Purdue) 
CCV (K378) 
CCV (Insavc/1) 
FIPV (79/1146) 
HCV (229E) 
PEDV (CVv777) 


IBV (Beaudette) 
IBV (KB8523) 
IBV (Gray) 


MHV (3) 
MHV (S) 
MHV (A59) 
MHV (1) 

MHV (JHM) 
RCV (681) 
BCV (Mebus) 
TCV (Minnesota) 


BCV (F15) 
BEV (P 138/72) HCV (0C43) 
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TGEV (FS772/70) 
TGEV (NEB72/RT) 
PRCV (TOYS56/165) 
TGEV (Purdue) 
TGEV (Miller) 
PRCV (RM4) 
PRCV (86/137004) 
CCV (Insave!1) 
FIPV (79/1146) 
HCV (229E) 
PEDV (CV777) 


IBV (Beaudette) 
IBV (M42S) 
IBV (M41) 

IBV (KB8523) 

IBV (M42H) 

IBV (D274) 

IBV (6/82) 


BCV (F15) 
BCV (L9) 
BCV (Mebus) 
BCV (Quebec) 
BCV (LY138) 
BCV (Vaccine) 
HCV (0C43) 
MHV (JHM) 
MHV (CP) 
MHV (4) 
MHV (A59) 
BEV (P 138/72) 
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FIGURE 1. Phylogenetic relationships of the (A) nucleocapsid protein, (B) integral membrane 
protein, and (C) large-surface glycoprotein of coronaviruses. Amino acid sequences were aligned 
using the "clustal" method (Higgins and Sharp, 1989) and phylogenetic trees were constructed 
using the neighborhood-joining method (Saitou and Nei, 1987]. The analyses were done using the 
MegAlign module of the Lasergene software suite (DNASTAR Ltd). The phylogenies are rooted 
assuming a biological clock. 


B. Sequence Relationships 


The sequence analysis of a number of coronavirus genes, including the 
complete genomes of avian infectious bronchitis virus (IBV), murine hepatitis 
virus (MHV}, and HCV 229E, provides a database for the analysis of possible 
evolutionary relationships between coronaviruses. This type of analysis is 
shown for the coronavirus large-surface glycoprotein, the integral membrane 
protein, and the nucleocapsid protein in Fig. 1. By and large, the nucleotide 
sequence data provide a striking confirmation of the conclusions derived from 
serological analysis. A closely related genetic cluster composed of HCV OC43, 
MHYV, bovine coronavirus (BCV), and turkey coronavirus (TCV) is evident in the 
cladograms derived for all three proteins. The viruses comprising antigenic 
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group 1 also form a genetic cluster. However, these viruses are clearly less 
closely related, and an evolutionary divergence of, in particular, the human 
coronavirus HCV 229E and the porcine coronavirus porcine epidemic diarrhea 
virus (PEDV) from the remainder of the group is suggested by the data. It should 
be remembered that the analyses presented here do not account for the possible 
role of recombination in the evolution of coronavirus genes (see Chapters 2 
and 12). 

As evidenced by immunofluorescence, seroneutralization, ELISA, and 
radioimmunoprecipitation, Berne virus and Breda virus are antigenically re- 
lated to each other but there is no cross-reactivity with antisera specific for 
other animal viruses, including coronaviruses. The available data suggest that 
the cross-reactive torovirus antigen is predominantly the surface glycoprotein 
(Weiss and Horzinek, 1987). The evidence for antigenic cross-reactivity with 
human toroviruses is based mainly on immunoelectron microscopy (Beards et 
al., 1986). At the present time, there is insufficient sequence data to evaluate 
the phylogeny of toroviruses. Limited but convincing sequence similarities in 
some of the gene products of toroviruses and coronaviruses (see Chapter 11) 
support the inclusion of these two genera in one family. 


II. GENOME ORGANIZATION 


At the present time, the available data suggest that the genomes of coro- 
naviruses and toroviruses contain between 6 and 11 functional ORFs (Spaan et 
al., 1988) (Fig. 2). The easiest to define are those encoding the structural pro- 
teins, in particular, the large surface glycoprotein (S), the integral membrane 
protein (M), and the nucleocapsid protein (N), which have a specific size and/or 
location toward the 3' end of the genome (for an explanation of nomenclature, 
see Table III). All coronaviruses contain a fourth structural protein ORF encod- 
ing the sM protein that is invariably located 5' proximal to the M protein ORF. 
Additionally, the genomes of BEV and coronaviruses of the antigenic group 2 
(e.g, HCV OC43, MIN, BCV) contain a fifth structural protein ORF, the 
hemagglutinin-esterase (HE) gene. The location of this ORF in coronaviruses is 
5' proximal to the S protein gene, and in BEV it lies between the M and N 
protein genes. 

The HE gene of coronaviruses and BEV appears to encode a protein with an 
accessory function that is not required for replication in cultured cells. Thus, in 
some coronavirus isolates (e.g., MHV A59) and the single BEV isolate, muta- 
tion, possibly in cell culture, has resulted in the loss of a functional HE ORF and 
conversion of these sequences to pseudogenes (Yokomori et al., 1991; Snijder 
and Horzinek, 1993). | 

In relation to the ORFs that encode nonstructural proteins, the coronavirus 
and torovirus genome is dominated by the RNA polymerase locus which con- 
sists of two large ORFs encompassing approximately 20,000 nucleotides toward 
the 5' end of the genome. The remainder of the coronavirus nonstructural 
protein ORFs present a diverse and complex pattern both in their number, size, 
and arrangement. The reasons for the complexity are probably twofold. First, as 


THE CORONAVIRIDAE 7 








Pol la A MN 
IBV 
Pol la 2 HE M 
Pol 1b 
MHV 
Sa “sM 
Pol la S 4bM 
Pol 1b [] HCV 
4 
: le 229E 
sM N 
Pol la M 


FIGURE 2. The genomic organization of a torovirus (BEV), a group 1 coronavirus (HCV 229E), a 
group 2 coronavirus (MHV|, and a group 3 coronavirus (IBV). The genomic ORFs encoding structural 
proteins are lightly shaded; those encoding nonstructural proteins are filled. The ORFs are drawn to 
scale in the correct relative reading frames. ORF Pol la is defined as reading frame zero. 


mentioned above, many coronavirus genes appear to encode proteins with 
accessory functions. Thus, under certain conditions (e.g., adaptation and propa- 
gation in cultured cells), mutations accumulate that lead to the inactivation 
and even deletion of these genes. Second, not only divergence from a common 
ancestor but also RNA recombination appear to be a major driving force in the 
evolution of coronavirus and torovirus genomes (see Chapter 12). This appears 
to have introduced a plasticity to the corona/torovirus genome that is excep- 
tional, even among RNA viruses. 

The genome maps shown in Fig. 2 should, therefore, be taken as an over- 


TABLE III. Coronaviridae Structural Proteins? 


Gene product Coronavirus Torovirus 
Nucleocapsid protein N  Nucleocapsid N Nucleocapsid 
Large-surface glycoprotein S Surface S Surface 
P Peplomer P Peplomer 
E2 Envelope 2 
Integral membrane protein M Membrane M Membrane 
El Envelope 1 E Envelope 
Small-membrane protein sM Small membrane — 
Hemagglutinin-esterase HE Hemagglutinin-esterase HEps Hemagglutinin-esterase 
glycoprotein E3 Envelope 3 (pseudogene) 
ORF 4 product 


“The nomenclature shown in bold type is recommended. 
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view, and isolates that display differences occur. The reader is also cautioned 
that the nomenclature of the ORFs encoding the nonstructural proteins of 
coronaviruses is not logical. In some cases, for example, the ORF 3a of canine 
coronavirus (CCV) and TGEV, the gene products are homologous, while others 
carrying the same designation, for example, the ORF 3a of IBV, are not. 


IV. MORPHOLOGY AND PHYSICOCHEMICAL PROPERTIES 


Coronaviruses are described as pleiomorphic but roughly spherical enve- 
loped particles, approximately 60 to 200 nm in diameter with a characteristic 
"fringe" of 20-nm-long surface projections. An "inner fringe" of short surface 
projections is sometimes seen on MHV, BCV, TCV, and HCV OC43 particles. 
Toroviruses are also pleiomorphic enveloped particles, although only 120 to 140 
nm in diameter. Toroviruses are disk-, kidney-, or rod-shaped and are likewise 
decorated with 20-nm surface projections. The nucleocapsid of both corona- 
viruses and toroviruses has a helical symmetry, but differs remarkably in its 
morphology. Coronavirus nucleocapsids are extended while those of toro- 
viruses are tubular. These morphological features and the assignment of struc- 


Coronaviruses Toroviruses 


(eg. HCV 229E) 





FIGURE 3. The structural components and schematic morphology of torovirus and coronavirus 
virions. RNA, genomic RNA; cap, a 5' cap structure; AAAA, a 3' polyadenylate tract; N, nucleocap- 
sid protein; M, integral membrane protein; S, large-surface glycoprotein; sM, small-membrane 
protein; HE, hemagglutinin-esterase protein. The stoichiometry of the virion components is shown 
arbitrarily. 
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tural proteins to the virus particles are illustrated for a group 1 coronavirus, a 
group 2 coronavirus, and a torovirus in Fig. 3. In this diagram, the sM protein is 
depicted as a structural protein of group 2 coronaviruses, although this has not 
yet been shown experimentally. 

Coronaviruses have an estimated molecular mass of 400 x 106 and a buoy- 
ant density in sucrose of 1.15 to 1.19 g/cm?. The buoyant density in CsCl is 1.23 
to 1.24 g/cm? and the sedimentation coefficient S20,w is 300 to 500. Corona- 
viruses are sensitive to heat, lipid solvents, nonionic detergents, formaldehyde, 
and oxidizing agents. Toroviruses have a buoyant density of 1.16 to 1.17 g/cm? in 
sucrose and an estimated sedimentation coefficient San, of 400 to 500. Virus 
infectivity is stable between pH 2.5 and 9.7 but rapidly inactivated by heat, 
organic solvents, and radiation. There is no inactivation of infectivity by phos- 
pholipase C or sodium deoxycholate. 

In the last 15 to 20 years many articles that describe the structural, func- 
tional, and antigenic properties of the coronavirus and torovirus proteins have 
been published. However, it was only in 1990 that a standard nomenclature for 
the structural proteins of coronaviruses was introduced (Cavanagh et al., 1990). 
This nomenclature has been adopted throughout this book but, to help the 
reader in the literature, a list of synonyms is presented in Table III. Unfor- 
tunately, there is, as yet, no standard nomenclature for the nonstructural pro- 
teins of coronaviruses or toroviruses. This is partly because there is insufficient 
information on the expression, processing, and function of these gene products, 
but also because of the aforementioned variability in the genomes of corona- 
and toroviruses. In this respect, the reader is advised to consider each virus 
species individually. 


V. CONCLUSION 


The readers of this book will, I hope, appreciate that considerable progress 
has been made in elucidating many aspects of coronavirus and torovirus biology 
in a relatively short time. However, there remain many unanswered questions, 
particularly in areas such as the genesis of corona- and torovirus subgenomic 
mRNAs, the structure and function of the viral nonstructural proteins, and the 
pathogenesis of natural infections. I hope that reading this book may stimulate 
young scientists to address some of these questions. 
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CHAPTER 2 


Coronavirus Replication, 
Transcription, and RNA 
Recombination 


ROBBERT G. VAN DER MOST AND WILLY J. M. SPAAN 


I. INTRODUCTION 


With the advent of recombinant DNA technology, our knowledge of the tran- 
scription and replication processes of positive-strand RNA viruses has in- 
creased profoundly. A major breakthrough in this field has been the develop- 
ment of full-length cDNA clones from which infectious RNA transcripts can be 
made. Such clones have now been constructed for members of the Picorna- 
(Racaniello and Baltimore, 1981), Bromo- (Ahlquist et al 1984), Alpha- (Rice 
et al., 1987) and Flaviviridae (Lai et al., 1991; Rice et al., 1989). Unfortunately, 
the enormous length of the coronavirus genome has hampered the construction 
of a full-length cDNA clone so far. 

In this chapter we will address new data that have been published in the 
field of coronavirus RNA replication, transcription, and recombination. In 
addition, we will discuss how cDNA clones of defective interfering (DI) RNAs 
may provide a promising alternative to study the replication and transcription 
of coronaviruses. Previous reviews that dealt with coronavirus transcription 
and replication appeared in 1988 (Spaan et al., 1988) and 1990 (Lai, 1990). 
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IL. REPLICATION 


Coronaviruses are positive-strand RNA viruses. Thus, upon infection, the 
genomic RNA is translated to produce the RNA-dependent RNA polymerase. 
The RNA polymerase transcribes the viral genome into a complementary RNA. 
The negative-stranded RNA in turn serves as a template for the synthesis of 
genomic RNA. The viral and/or host enzymes involved in the RNA replication 
process of coronaviruses are unknown. The translation product(s) of the huge 
polymerase gene are proteolytically processed; however, the processing path- 
way is not well characterized (Baker et al., 1989, 1993; Denison and Perlman, 
1987; Denison et al., 1991, 1992; Weiss et al., 1994). By analogy with other 
positive-strand RNA viruses (Andino et al 1993; Lemm and Rice, 1993a,b), it is 
thought that processed end-products as well as precursor polypeptides and host 
factors may be involved in the replication process. 

The RNA polymerase must recognize specific signals that control the 
initiation of plus- and minus-strand RNA synthesis. Mapping of these cis- 
acting signals is a prerequisite for our understanding of coronavirus replication. 
Because a full-length cDNA clone of a coronavirus is not available, DI RNAs are 
the tools of choice to study these signals. DI RNAs are replicated in virus-infected 
cells, and must therefore contain the essential cis-acting replication signals. 

During the last few years several natural DI RNAs of murine hepatitis virus 
(MHV) strains JHM (Makino et al., 1985, 1988a, 1990; Makino and Lai, 1989a) 
and A59 (van der Most et al., 1991) have been characterized and used as tem- 
plates to construct full-length cDNA clones. Synthetic DI RNAs can be tran- 
scribed in vitro from these clones and are replicated on transfection into MHV- 
infected cells. The structures of the JHM DIs DIssE and DIssF and the A59 DI 
MIDI are schematically shown in Fig. 1. All three DI RNAs contain the 5' and 3' 
terminal sequences of the viral genome. MIDI is a synthetic copy of the natural 
DI RNA DI-a and consists of three noncontiguous fragments of the viral ge- 
nome, i.e., the 5' terminal 3889 nucleotides (nt), 799 nt derived from the 3' end 
of open reading frame 1b (ORFIb|, and the 3’ terminal 806 nt (van der Most et al., 
1991). Both DIssF and MIDI contain sequences from the 3' end of ORFIb, which 
are required for RNA packaging (Makino et al., 1990; van der Most et al., 1991). 
This packaging signal was mapped as a 61 nt sequence that could be folded into a 
bulged stem-loop structure (Fosmire et al., 1992). It is not yet known whether 
this sequence alone is sufficient for RNA packaging. 

The minimal sequence requirements for replication can be mapped by 
comparing the sequences of the three natural DI RNAs and by deletion muta- 
genesis. Deletion mapping of DIssE, DIssF, and MIDI has demonstrated that the 
3' replication signal comprises the 3’ terminal 447 nt (Kim et al., 19932; Lin and 
Lai, 1993; van der Most and Spaan, unpublished results) (Fig. 1). It has been 
suggested that a short conserved sequence element in the 3' nontranslated 
region of the genomic RNA (5'-CGAAGAGC-3’) acts as a replication signal for 
minus-strand synthesis (Lai, 1990). Clearly, the results from the deletion map- 
ping suggest that this sequence alone is not sufficient and that the signal(s] 
comprise a larger stretch of sequences. However, very recently Lin et al. have 
provided evidence that the 3' terminal 55 nucleotides of the positive strand 
RNA plus the poly(A) tail comprise the cis-acting signal for the initiation of 
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FIGURE 1. Schematic structure of the genomes of MIDI, DIssE, DIssF, and B36. The sequences 
derived from ORFla, ORFIb, and the 3’ end of the genome [N|] are indicated by different shadings. 
The putative enhancer sequence in ORFla and the RNA packaging signal in ORFIb (pack) are 
indicated by black bars. | 


minus-strand synthesis (Lin et al., 1994). This would imply that part of the 3’ 
terminal 447 nt that were identified as the 3' replication signal, are in fact 
required for positive strand synthesis (Lin et al., 1994). Thus, initiation of 
positive strand synthesis could involve interactions between 5'- and 3'-termi- 
nal signals. 

Mapping of the 5' replication signal has yielded conflicting results on two 
points. First, it is unclear which sequences at the very 5' terminus of the 
genome are needed. An artificial MHV-A59 DI RNA, DI-B26, which consists of 
the 5' terminal 467 nt and the 3' terminal 1671 nt (Fig. 1), is replicated in MHV- 
A59-infected cells (Masters et al., 1994). DI-B36 was originally constructed on 
the basis of a bovine coronavirus (BCV) DI RNA (Hofmann et al., 1990; Masters 
et al., 1994), which contains the 5’ terminal 498 nt and which is also replicated 
on transfection into BCV-infected cells (Brian et al., 1994). These data suggest 
that the 5' 467 nt comprise the 5' replication signal. Accordingly, the 5' termi- 
nal 474 nt were found to be sufficient for replication of DIssE (Kim et al., 19932]. 
In contrast, the 5’ terminal 482 nt do not support replication of DIssF (Lin and 
Lai, 1993). Deletion of the sequence between positions 482 and 706 abrogates 
replication of DIssF (Lin and Lai, 1993}, whereas deletion of the same fragment 
from DIssE only reduces replication efficiency slightly (Kim er al., 19934]. 
Preliminary evidence from our laboratory suggests that for (efficient) replica- 
tion of the MIDI-derivatives the 5' 1.1 kb is essential (Luytjes, Gerritsma, and 
Spaan, unpublished results). It is possible that these differences reflect the 
different experimental protocols used. The deletion mapping experiments with 
DIssF and MIDI have involved in vivo transcription of DI RNA by a vaccinia 
virus-encoded T7 RNA polymerase (Fuerst et al., 1986}, in combination with 
DNA transfection (Lin and Lai, 1993; van der Most et al., 1992). Replication of 
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the DIssE mutants has been assessed in MHV-infected and RNA-transfected 
cells (Kim et al., 1993a) and the analysis of DI-B36 has been done with a yet 
different protocol, which involves seeding of RNA-transfected L cells onto a 
monolayer of 17CII cells (Masters et al., 1994]. 

The second issue on which the current data conflict is the finding that 
replication of the JHM Dis DIssE and DlIssF requires an internal sequence 
element of 135 nt (Kim et al 1993a; Lin and Lai, 1993) (Fig. 1). This sequence 
element is located 3 kb from the 5' terminus of the genome and is contained in 
MIDI, but is not present in most of its derivatives leg, AH-in (de Groot et al., 
1992)], nor in DI-B36 (Masters et al., 1994). These DI RNAs are, however, 
replicated, indicating that this presumptive signal is not an absolute require- 
ment for MHV DI replication. It has been suggested that this signal functions as 
a replication enhancer (Kim et al., 1993a), which would explain why RNA 
replication is readily detected in DIssE-transfected and virus-infected cells. 
Clearly, the precise sequence of the cis-acting signals, the extent to which these 
data can be extrapolated to the viral genome, and the role of the putative 
enhancer remain to be elucidated. The sequence elements that have been 
mapped comprise the 5' and 3' nontranslated regions of the genome, as well as 
parts of ORFla and the N gene. To date, it is unclear which proteins interact 
with these replication signals. Several host factors have been shown to bind to 
the 5' terminus of the plus-strand and to the 3' terminus of the minus-strand 
RNA (Furuya and Lai, 1993), and it is possible that these proteins play a role in 
the initiation of RNA synthesis. Further characterization of the enzymology of 
replication will await the reconstruction of a functional polymerase gene or the 
development of an in vitro RNA replication system. 

A peculiar feature of the MHV DI RNAs is that they contain large ORFs. 
For instance, the three fragments of DIssE as well as the three fragments of 
MIDI are joined in-frame (Makino et al., 1988a; van der Most et al., 1991). There 
is now substantial evidence that the presence of large ORFs in the genomes of 
MHV DI particles is biologically significant (de Groot et al., 1992; Kim et al., 
1993b). For DIssE it appears that the ORF is not required in DI-transfected and 
MH V-infected cells, but is important after one passage (Kim et al., 1993b). In 
principle, the requirement for a large ORF could be related to RNA stability (de 
Groot et al., 1992), replication, packaging, or uncoating. The results obtained for 
DIssE may suggest that the large ORF is not required for RNA replication. 
However, these data have not yet been confirmed for MIDI and its derivatives 
(van der Most and Spaan, unpublished results). Furthermore, it is not known if 
these data can be extrapolated to the viral genomic RNA. 


III. TRANSCRIPTION 
A. Introduction 
Coronavirus gene expression involves the synthesis of five to eight sub- 


genomic mRNAs (Lai, 1990; Spaan et al., 1988). These mRNAs are synthesized 
in different but constant molar ratios in the infected cell. As an illustration, 
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TABLE I. MHV Subgenomic mRNAs, 
Coding Assignments, and Relative Molarities 


Relative molarity 


MHV mRNA Protein (percent)? 
mRNAI (genome) RNA-dependent RNA polymerase 10.4 
mRNA2 Nonstructural protein 2a 4.7 
mRNA2-1 (not all strains) HE glycoprotein 

mRNA3 Spike glycoprotein 4.2 
mRNA4 Nonstructural protein 6.5 
mRNAS ORF5a: nostructural protein 13.1 

ORF5b: small membrane protein 

mRNA6 Membrane glycoprotein 15.7 
mRNA7 Nucleocapsid protein 45.5 


4From Jacobs et al. (1981). 


Table I summarizes the MHV mRNAs, their translation products, and their 
molar ratios. The mRNAs form a 3' coterminal nested set, i.e., they have 
identical 3' ends but extend for different lengths in the 5' direction (illustrated 
for MHV in Fig. 2). Only the 5’ most gene(s), which is not present in the next 
smaller mRNA, is translated (Lai, 1990; Spaan et al., 1988) (see Chapter 3). On 
the genome, the transcription units are preceded by short sequence elements, 
termed intergenic regions. The complementary intergenic regions on the 
negative-stranded RNA are thought to function as promoters and will therefore 
be referred to as the "intergenic promoter sequences" or "promoters" in this 
chapter. 

The subgenomic mRNAs not only form a 3’ coterminal nested set, but are 
also 5’ coterminal: all mRNAs contain a common 5’ leader sequence (Lai et al., 
1984; Spaan et al., 1983] (Fig. 2). For different coronaviruses, the length of the 
leader ranges from 65 to 98 nt. The leader sequence is only found at the very 5' 
terminus of the genome, which implies that the synthesis of subgenomic 
mRNAs involves fusion of noncontiguous sequences. To explain the synthesis 
of leader-containing subgenomic mRNAs, several models have been put for- 
ward. For instance, it has been suggested that the mRNAs arise via splicing of 
larger precursor RNAs or by "looping-out" of intervening sequences. The so- 
called leader-primed transcription model proposes that short leader RNAs are 
transcribed from the 3’ end of the genomic negative strand and act as primers for 
subgenomic mRNA synthesis (Lai et al., 1984; Spaan et al., 1983). Because in 
earlier experiments, only genome-length negative-strands were found in MHV- 
infected cells (Lai et al., 1982), it seemed that these molecules served as the 
exclusive templates for transcription. However, the recent discovery of sub- 
genomic negative-strands and double-stranded subgenomic RNAs in corona- 
virus-infected cells has provoked renewed discussion (Hofmann et al., 1990, 
Sawicki and Sawicki, 1990; Sethna et al., 1989): these findings imply that the 
subgenomic mRNAs could also be transcribed from their negative-stranded 
counterparts. 

Much of our knowledge of coronavirus transcription has been obtained by 
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FIGURE 2. Coding assignments and the nested set structure of the MHV subgenomic mRNAs. The 
common 5' leader sequence is indicated by a black box. The 5' most unique ORF(s] of each mRNA 
are indicated by open boxes. The sequences downstream of the leader of the subgenomic mRNAs 
will be referred to as "body" sequences. 


sequencing subgenomic mRNAs and intergenic regions and by analyzing 
negative-stranded RNAs. Furthermore, an experimental system has been set up 
based on synthetic DI RNAs carrying intergenic promoter sequences (Makino 
et al., 1991; van der Most et al., 1994). In the present situation, two basic 
questions can be asked. How are the leader sequences attached to the down- 
stream "body" sequences and which are the templates for mRNA synthesis? 
The next sections will address the experimental evidence that has accumulated 
on these issues. We will focus on the discovery of the subgenomic negative 
strands and its consequences for the current coronavirus transcription models. 
We will also discuss the role and the sequence requirements of the intergenic 
regions and the role of basepairing in determining mRNA abundance. 


B. Negative-Stranded RNAs 


The actual double-stranded structure that "produces" RNA is designated 
the replicative intermediate (RI). Visualization of double-stranded RNAs by gel 
electrophoresis and fluorography usually involves digestion of the excess 
single-stranded RNA by RNase A. The remaining double-stranded RNase- 
resistant molecules are referred to as the replicative forms (RFs). The RFs do not 
always correspond to the RIs because the RIs could also have been digested at 
certain RNase-sensitive sites to yield smaller RFs. 

In 1989, Sethna and co-workers reported the discovery of genomic and 
subgenomic negative-stranded RNAs in transmissible gastroenteritis virus 
(TGEV -infected cells. They identified the negative-stranded RNAs by using 
strand-specific oligonucleotide probes in Northern (RNA) blot analyses. This 
observation was confirmed for BCV (Hofmann et al., 1990). It was subsequently 
demonstrated that the subgenomic negative strands contained an anti-leader 
sequence (Sethna et al., 1991). 

RNase digestion of cytoplasmic RNA isolated from TGEV-infected cells 
yielded a set of RFs corresponding to the subgenomic mRNAs (Sethna et al., 
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1989]. The RFs contained very small quantities of full-length positive-stranded 
RNAs, whereas full-length negative-stranded RNAs were abundant. This sug- 
gested that positive-stranded RNAs were being synthesized from a negative- 
strand template. However, these authors did not provide evidence that the 
subgenomic RFs did in fact correspond to subgenomic RIs. In 1990, Sawicki and 
Sawicki reported the existence of both genomic and subgenomic RFs in MHV- 
infected cells. They showed that each subgenomic RF was derived from a 
corresponding subgenomic RI. The RIs were transcriptionally active: following 
a very short pulse (2 min) with [?H]uridine, the label was predominantly incor- 
porated into the RIs and to a much lesser extent into the subgenomic RNAs. 
After longer pulse periods (up to 300 min|, only labeled subgenomic mRNAs 
could be detected. This was interpreted as an indication that the label "flows 
through" the RIs into the single-stranded RNAs produced from them (Sawicki 
and Sawicki, 1990]. 

The discovery of subgenomic negative strands has led to two different 
models of how coronaviruses produce their mRNAs, which differ in the nature 
of the template(s) for mRNA synthesis. Sethna and co-workers speculated that 
the subgenomic mRNAs might in fact be amplified as independent replicons 
(Fig. 3A). The initial subgenomic mRNA templates for replication could either 
be present in the virion or could be synthesized from the antigenome via leader- 
primed transcription (Sethna et al., 1989]. This model implies that, at least early 
in infection, the genomic RI would yield subgenomic RFs upon RNase digestion 
(Sawicki and Sawicki, 1990). Sawicki and Sawicki could not detect such RFs and 
therefore proposed an alternative model. They suggested that subgenomic nega- 
tive strands transcribed from the genome serve as templates for the synthesis of 
the corresponding mRNAs and not vice versa (Fig. 3B). To date, all attempts to 
obtain direct evidence for mRNA replication by transfecting synthetic sub- 
genomic RNAs into coronavirus-infected cells failed to demonstrate any such 
replication (Brian et al., 1994; Makino et al., 1991; Luytjes and Spaan, unpub- 
lished results]. Thus, several lines of evidence seem to suggest that the mRNAs 
may not undergo replication; instead, the hypothesis that the subgenomic 
mRNA s are transcribed from the subgenomic negative strands has gained some 
momentum (Sawicki and Sawicki, 1990; Hofmann et al., 1993). Alternatively, it 
has been argued that the negative strands are synthesized from the subgenomic 
mRNAs as dead-end products (Jeong and Makino, 1992). It should be noted, 
however, that these conclusions are based on negative results. For instance, the 
failure to directly detect mRNA replication could also be explained by assum- 
ing that transfected subgenomic RNAs are, for some unknown reason, not 
suitable templates for replication. 

Very recently, Schaad and Baric (1994) have provided strong genetic evi- 
dence indicating that, at least late in infection, the subgenomic negative strand 
RNAs are the functional templates for subgenomic mRNA synthesis. They 
studied a temperature-sensitive MHV-mutant, defective in negative strand syn- 
thesis, and found that the subgenomic RFs still incorporated radiolabel after a 
shift to the restrictive temperature. Because minus-strand synthesis was blocked, 
these results strongly suggest that the transcription indeed occurs on sub- 
genomic negative strands (Schaad and Baric, 1994). 
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FIGURE 3. Schematic representation of three transcription models. Plus- and minus-strands are 
indicated by open and by black bars, respectively. On the plus-strands, the leader sequences and 
intergenic regions are indicated by shading; on the minus-strand, the intergenic promoter is shown 
by shading. (A) A genomic minus-strand is the template for subgenomic mRNA synthesis. The 
subgenomic mRNAs are first synthesized by leader-primed transcription and are then amplified. 
(B) Subgenomic minus-strands are synthesized by premature termination of transcription and serve 
as templates for leader-primed transcription. (C) A subgenomic minus-strand is synthesized by 
discontinuous transcription, i.e., the template jump occurs during minus-strand synthesis. The 
subgenomic minus-strand are the templates for subgenomic mRNA synthesis. 


C. Leader-Primed Transcription 


The experiments described above do not address the question of how the 
leader sequences are attached to the body sequences. According to the original 
model of leader-primed transcription, short leader RNAs are transcribed from 
the 3' end of the genomic negative strand, translocated to the intergenic pro- 
moters on the negative strand(s), and then extended (Lai, 1990; Spaan et al., 
1988). A key observation that has been interpreted in favor of this model is that 
the leader contains a complementary promoter sequence (CPS] near its undeter- 
mined 3' end. This would allow basepairing to occur between the 3' end of the 
leader and the promoter, thus facilitating the priming event (Budzilowicz et al., 
1985; Spaan et al., 1983). 

Before the discovery of subgenomic negative strands, alternative transcrip- 
tion models, e.g., cis-splicing of genome-length precursor RNAs, could be ex- 
cluded on the basis of the UV inactivation kinetics of mRNAs (Jacobs et al., 
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1981; Stern and Sefton, 1982}. UV inactivation studies demonstrated that, late in 
infection, the UV target size of each mRNA corresponds to its physical size. 
This indicated that transcription of the mRNAs was independent of the syn- 
thesis of a genome length precursor. Based on the conclusion that subgenomic 
negative strands did not exist (Lai et al., 1982), these data provided strong 
support for the leader-primed transcription model. The discovery of the sub- 
genomic negative strands radically changed the picture, because alternative 
models could no longer be ruled out. For instance, any model in which mRNAs 
are transcribed from subgenomic templates is consistent with the UV transcrip- 
tion mapping. 

One variant leader-primed transcription model, based on the existence of 
transcriptionally active subgenomic RIs, is that a nested set of subgenomic 
negative strands is synthesized first (Fig. 3b). The intergenic promoter se- 
quences act as transcription attenuators in this model. These RNAs then serve 
as templates for leader-primed transcription (Sawicki and Sawicki, 1990). Thus, 
leader priming and transcription from subgenomic negative strands are not 
mutually exclusive. However, this model still assumes the existence of free 
leader RNAs. There are some indications that free leader transcripts do indeed 
exist. Baric and co-workers (1985, 1987) detected short RNA molecules contain- 
ing leader sequences in MHV-infected cells. The biological significance of the 
small RNAs is not clear, though. Baker and Lai (1990) subsequently demon- 
strated that synthetic leader transcripts are incorporated into subgenomic 
mRNAs when added to an in vitro transcription system. An important observa- 
tion was that the CPS at the 3' of the leader was essential. When the CPS was 
deleted, the exogenous leaders were not incorporated. Although these data are 
suggestive, it cannot completely be ruled out that the synthetic leader RNAs 
did not actually prime transcription but merely functioned as templates for 
RNA recombination (Liao and Lai, 1992). The high-frequency leader reassort- 
ment that occurs during mixed infection of MHV strains has also been inter- 
preted in support of the leader-primed transcription model (Jeong and Makino, 
1994; Makino et al., 1986a; Makino and Lai, 1989a; Zhang et al 1991). However, 
leader reassortment does not prove leader priming, but rather indicates that 
there is at least one step in trans in mRNA synthesis. If, for instance, the 
discontinuous transcription step occurs during negative-strand synthesis (and 
not by leader priming) (Fig. 3c), one would still observe “leader reassortment" 
when the (reassorted] negative strands are transcribed to produce mRNAs. 
Interestingly, leader reassortment appears to depend on a 9 nt sequence directly 
downstream of the leader; deletion of this sequence results in an exclusive use 
of the helper virus leader, which is provided in trans (Makino and Lai, 1989; 
Zhang et al., 1994]. The function of the 9 nt sequence is unknown, but it has 
been proposed that it functions as a transcriptional terminator, controlling the 
synthesis of free leader transcripts (Zhang et al., 1994). 

As discussed in Section B, there is now genetic evidence strongly suggest- 
ing that subgenomic minus-strands are the functional templates for sub- 
genomic mRNA synthesis (Schaad and Baric, 1994}. Indeed, one experimental 
approach to determine whether leader-primed transcription occurs on sub- 
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genomic negative strands would be to transfect subgenomic negative strands 
and determine whether subgenomic mRNAs are transcribed from these tem- 
plates. 


D. The Intergenic Promoter Sequence 


The intergenic regions are short sequence elements upstream of the tran- 
scription units. Because the leader- mRNA junction occurs within the inter- 
genic region, this sequence or its minus-sense counterpart, the intergenic pro- 
moter sequence, is considered to be crucial for mRNA synthesis. The potential 
basepairing between the leader CPS and the promoter is one of the cornerstones 
of the leader-primed transcription model. The intergenic promoter sequences 
for different genes in one coronavirus may differ slightly in sequence and in the 
extent of potential basepairing with the leader CPS. For MHV, the extent of 
potential basepairing ranges from 9 to 18 basepairs; every promoter contains the 
sequence 3’ UUAGAUUUSG 5’, or a closely related sequence in the case of the 
promoters for mRNA2, mRNA2-1, and mRNAG (Table II). The consensus se- 
quences of different coronaviruses are quite similar, though different in length 
(Table III). Avian infectious bronchitis virus (IBV) has the most divergent se- 
quence, whereas the consensus sequences of the coronaviruses belonging to 
clusters I and II (Chapter 3] share homology. 

Until recently, many basic features of transcription remained unclear due 
to the lack of an appropriate experimental system. Such issues included the 
minimal sequence requirements for transcription initiation, the role of base- 
pairing in promoter recognition, and the position(s) at which transcription 
starts. An important development has been the use of synthetic MHV DI RNAs 
containing additional intergenic promoter sequences (Makino et al 1991; van 
der Most et al., 1994) (Fig. 4). Makino and co-workers (1991) have inserted the 
mRNA/7 promoter and 0.54 kilobase (kb) of its flanking sequences into the 
genome of DIssF. Transfection of this DI RNA into MHV-infected cells resulted 
in the synthesis of the DI-derived subgenomic RNA. We have cloned short 
oligonucleotides (10—18 nt), comprising only the intergenic promoter sequences 


TABLE II. Sequences of MHV Intergenic Promoter Sequences 


Promoter Sequence (on negative strand) 
mRNA2 3'UUAGAAUAG-5' 
mRNA2-1 A59: 3'-AUUA - -UUCGAA-°-5' 
JHM: 3'-AUUA - -UUUGA A-5' 
mRNA3 3'AUUAGAUUUG-5' 
mRNA4 QJ'UUAGAUUUG-5' 
mRNA5 J'-"GAUUAGAUUUG-5' 
mRNA6 3'UAGAUUAGGUUUGA-5' 
mRNA7 J3'UUAGAUUAGAUUUGAAAU:-5' 


Leader (^ strand)  5-UGUAGUUUAAAUCUAAUCUAAACUUUAUAAA-3' 
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TABLE III. Consensus Sequences for Several 
Coronaviruses from the Three Clusters? 


Antigen 
Virus cluster Consensus sequence 
MHV-A59 I 3'-UUAG (A/G]U(U/A]UG-5' 
BCV I 3'-AG[A/GJUUUG-5' 
FIPV II 3'-UUGAUUUG-5' 
TGEV II 3'-UUGAUUUG-5' 


IBV IIT à'-GAAUUGUU-5' 
?From Spaan et al. (1988). 


and no flanking sequences, and found that these sequences alone suffice to 
direct subgenomic DI RNA synthesis (van der Most et al., 1994). 

In this system, promoter strength has been defined as the ratio between the 
amounts of subgenomic and genomic DI RNAs synthesized in the cell. Inser- 
tion of the mRNA7 promoter with 0.54 kb of flanking sequences into DIssF 
gave a promoter strength of 0.8. Extension or deletion of the non-basepaired 
flanking sequences did not affect promoter strength (Makino and Joo, 1993). 
Deletions within the intergenic region reduced promoter strength: efficient 
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FIGURE 4. Schematic representation of the synthesis of a subgenomic RNA from a DI template. 
The sequences of the DI that are derived from ORFla (la), ORFIb (1b), and the 3’ end of the genome 
(N) are indicated by different shadings. The DI minus-strand is indicated by a black bar. A 17- 
nucleotide promoter sequence is inserted downstream of the N ORF and is indicated as a dotted box 
on both the plus- and minus-strands. The complementary promoter sequence (CPS) in the leader is 
indicated. The 3’ UUAGAUUUG 5' consensus sequence and its plus-sense counterpart are boxed. 
Also indicated are the leader RNAs that originate either from the DI or from the viral genome. 
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subgenomic RNA synthesis requires a promoter larger than the partial con- 
sensus sequence 3' AGAUUUG 5’ {Joo and Makino, 1992; Makino and Joo, 
1993. Makino et al., 1991; van der Most et al., 1994). Thus, the intergenic 
promoter sequence is required for transcription, whereas the flanking se- 
quences are not important. 


E. The Control of Subgenomic mRNA Abundance 


The coronavirus subgenomic mRNAs are synthesized in different quan- 
tities. For instance, in MHV-infected cells, mRNA7 is synthesized in much 
greater quantities than mRNA3 (Table I). An important question is whether the 
determinants that control mRNA abundance reside in the intergenic promoter 
sequence. Both the sequence of the promoter and the extent of potential base- 
pairing with the leader CPS could determine mRNA levels. For MHV there is a 
rough correlation between mRNA abundance and the length of the potential 
duplex. Based on this correlation, it has been proposed that the extent of 
basepairing controls mRNA abundance (Shieh et al., 1987). However, no such 
correlation exists for IBV (Konings et al., 1988) and BCV (Hofmann et al., 1993). 
The relation between the extent of basepairing and mRNA abundance has been 
tested in our laboratory by inserting the 10 nt mRNA3 and 18 nt mRNA7 
promoters into the MIDI genome (van der Most et al., 1994). The "basepairing 
hypothesis" would predict that the mRNA7 promoter would be stronger than 
the mRNA3 promoter in this system. On the contrary, we found that the 
mRNA3 and mRNA7 promoters were of similar strength. In fact, promoter 
activity was even slightly higher for the mRNA3 promoter. Thus, these data 
clearly demonstrate that the extent of basepairing does not control mRNA 
levels. 

The role of the specific nucleotide sequence has been studied by mutating 
single nucleotides in different promoters. Introduction of single substitutions 
into a 13 nt promoter demonstrated that the sequence of the intergenic promoter 
can be very flexible (Joo and Makino, 1992]. Results from our laboratory show 
that promoter strength is affected only slightly when a single nucleotide in a 17 
nt promoter or in the 18 nt mRNA7 promoter is mutated (van der Most et al., 
1994). In contrast, introduction of the same substitutions into the 10 nt mRNA3 
promoter results in a more than tenfold reduction of promoter activity. This 
could suggest that transcription initiation requires a duplex of a certain mini- 
mal stability. Once this condition is met, extending the basepairing does not 
increase promoter strength. However, some of the substitutions in the 17 or 18 
nt promoters did reduce promoter strength, without interrupting the leader- 
promoter duplex because G-U basepairs could be formed. This could indicate 
that recognition of the promoter sequence does not depend on basepairing only. 
Instead, direct interactions between the polymerase and the promoter could 
play a role in promoter recognition. Hofmann et al. (1993) came to the same 
conclusion based on their finding that the leader-body junction of one BCV 
mRNA had occurred on the unusual sequence 3'-CCAUCUG-5' instead of the 
expected sequence 3'-AGGUUUG-5'. 
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The fact that some mutations reduce promoter strength opens the possi- 
bility that the levels of certain mRNAs are downregulated because their pro- 
moters contain point mutations [e.g., the MHV mRNAG promoter) (Table II). 
However, this does not seem to be a general principle. Coronavirus mRNAs are, 
in general, synthesized at a rate inversely related to their length (Hofmann et 
al., 1993; Konings et al., 1988; Sethna er al., 1989). This is similar to observa- 
tions made by French and Ahlquist (1988) who, upon introduction of multiple 
promoter elements into a synthetic RNA3 of brome mosaic virus, found a 
gradient of accumulation favoring the smaller mRNAs. Thus, for coronaviruses 
the transcriptional activity of a promoter may also be (in part) determined by its 
position on the negative-stranded template. This could be accomplished by 
premature termination during plus-strand synthesis, i.e., larger RNA mole- 
cules are more prone to premature termination of transcription (Konings et al., 
1988). Alternatively, premature termination could occur during negative-strand 
synthesis (Sawicki and Sawicki, 1990). This latter idea is consistent with the 
transcription model in which a nested set of subgenomic minus strands is 
synthesized first: larger negative strands are produced in lower quantities be- 
cause they encounter more transcription attenuators (see Section C) during 
their synthesis. However, any such model should take into account that, for 
feline infectious peritonitis virus (FIPV) and TGEV, the shortest mRNAs (RNAs 
7 and 6, respectively) are produced in much lower quantities than the next 
larger mRNA encoding the nucleocapsid protein (de Groot et al 1987; Sethna et 
al., 1989). Direct experimental evidence that mRNA abundance is influenced 
by the presence of additional up- or downstream promoters is not yet available. 


F. Heterogeneity of Subgenomic mRNAs 


For MHV, every intergenic promoter contains the sequence 3’ UUAGA- 
UUUG 5', or a closely related sequence (Table II). The MHV-A59 leader con- 
tains two repeats of the 5' UCUAA3’ sequence (Makino et al., 1988b). As 
mentioned above, the fusion between leader and body sequences occurs within 
the leader-promoter duplex. For most promoters, a perfect duplex is formed; it 
is therefore impossible to map the site(s) at which transcription starts. How- 
ever, some promoters from MHV (e.g., the mRNA6 promoter) and BCV contain 
mutations, and these mutations are transcribed into the mRNAs (Armstrong et 
al., 1984; Hofmann et al., 1993). This indicates that transcription started up- 
stream of the mutations. The same phenomenon was observed in an in vitro 
transcription system (Baker and Lai, 1990]. To explain this, Baker and Lai (1990) 
proposed that 3' terminal sequences of the leader RNA that do not basepair are 
trimmed and that a hypothetical nuclease recognizes mismatches within the 
duplex. By using the substitutions in the DI promoters as markers, it was 
subsequently found that not all mutations were transcribed into the sub- 
genomic mRNAs (Joo and Makino, 1992; van der Most et al., 1994). Mutations 
at the 5’ end of the promoter were always transcribed, but mutations near the 3’ 
end gave rise to two populations of mRNAs: mRNAs that had transcribed the 
mutation and mRNAs that contained the leader sequence (Fig. 5). This sug- 
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FIGURE 5. The use of markers to map the position(s) of transcription reinitiation. The marker is 
indicated by an asterisk. Open bars depict leader sequences. Black bars represent the negative- 
stranded template and the sequences that are transcribed from the template. 


gested that transcription initiates at multiple—possibly random—sites, pre- 
sumably resulting from trimming of the leader; mismatch-specific repair does 
not occur (van der Most et al., 1994). 

A different type of heterogeneity has been described for MHV-A59 mRNA6 
and MHV-]HM mRNAs 6 and 7 (Armstrong et al., 1984; Baker and Lai, 1990; 
Makino et al., 1988b; Shieh et al., 1987]. For these mRNAs, the number of 
UCUAA repeats at the leader-body junction is variable. This was also observed 
for the mRNA7 promoter that had been inserted into MIDI (van der Most et al., 
1994). In this system, the leader, which was derived from MHV-A59, contains 
two pentanucleotide repeats. Two populations of mRNAs were detected: the 
majority of mRNAs contained two repeats, whereas a minor population had 
three repeats (Fig. 6). The heterogeneity has been explained by alternative 
basepairing between these promoters and the leader CPS (Makino et al., 1988b]. 
However, as exemplified in Fig. 6, these promoters contain two (imperfect) 
repeats of the promoter consensus sequence. Each of these individual "pro- 
moter domains" could give rise to a transcript, and these transcripts would 
differ in the number of repeats. This notion was supported by analyzing mutant 
promoters in which the ratio between the two transcripts was altered [van der 
Most et al., 1994). 

The number of pentanucleotide repeats in the leader CPS differs for the 
various MHV strains, e.g., MHV-A59 has two repeats in the leader, whereas the 
MHV-2c leader contains four repeats (Makino et al., 1988b]. During passage of 
MHV-JHM, the number of pentanucleotide repeats changes from three to two 
(Makino and Lai, 1989b). The differences in the number of UCUAA repeats of 
the JHM leader affect the transcription of certain subgenomic mRNAs. For 
example, transcriptional activity of the mRNA2-1 promoter of MHV-]HM 
(which encodes the hemagglutinin esterase gene) depends on the number of 
UCUAA repeats in the leader CPS: mRNA2-1 is only synthesized when the 
leader CPS contains two UCUAA repeats and not when three repeats are 
present (Shieh er al., 1989). Additional examples of differential transcription 
have been described recently (La Monica et al 1992). There is as yet no explana- 
tion for the differential activity of these promoters. This phenomenon has been 
exploited to demonstrate that the transcription mechanism acts in trans. The 
mRNA2-1 promoter was inserted into a DI RNA with a three-repeat leader; this 
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FIGURE 6. Heterogeneity at the leader-body fusion sites of subgenomic RNAs, illustrated for the 
17 nt promoter inserted in MIDI. The structure of the DI genome is shown in Fig. 3. The UCUAA 
repeats in the leader are indicated. Basepairing is indicated by asterisks. (A,B) Synthesis of sub- 
genomic RNAs containing two or three UCUAA repeats. (C) Schematic view of the overlapping 
promoter domains PD-1 and PD-2. 


leader does not support transcription from the inserted promoter. When the 
helper virus contained a two-repeat leader, a subgenomic DI RNA was synthe- 
sized. Because RNA recombination could be excluded, these data indicate that 
the transcription process occurs in trans (Jeong and Makino, 1994). 


IV. RNA RECOMBINATION 


Homologous RNA recombination has been demonstrated for picorna- 
viruses (Cooper, 1968; King et al., 1982), bromoviruses (Allison et al., 1990; 
Bujarski and Kaesberg, 1986) and coronaviruses (Lai et al., 1985). During a mixed 
infection of two MHV strains, RNA recombination occurs at a high frequency 
(Lai et al., 1985; Makino et al., 1986b). High-frequency RNA recombination 
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does not only occur in tissue culture but also in infected mouse brains (Keck et 
al., 1988a). Based on the analysis of a set of temperature sensitive (ts) mutants, 
the frequency of recombination for the entire MHV genome has been estimated 
to be 25% (Baric et al., 1990). Recombinant murine coronaviruses have been 
isolated by employing different selection schemes, such as rescue of ts lesions 
(Baric et al 1990; Lai et al 1985; Makino et al., 1986b), resistance to neutraliz- 
ing monoclonal antibodies (Makino et al., 1987), and rescue of a fusion-negative 
MHV variant (Keck et al., 1988b). Many of these recombinant viruses had 
undergone additional recombination events outside the selection markers, 
again illustrating the high frequency of recombination (reviewed in Lai, 1992). 
At present, itis not known how widespread homologous RNA recombination is 
among other coronaviruses. The only other coronavirus for which recombina- 
tion in vivo has been observed is IBV (Kusters et al 1990). 

Although coronavirus RNA recombination was first thought to take place 
at certain “hot spots” (Banner et al., 1990}, it was later recognized that recom- 
bination is almost random and that these previously identified hot spots re- 
sulted from selection of recombinant viruses (Banner and Lai, 1991). Corona- 
virus RNA recombination most likely occurs via template switching: the 
polymerase and the nascent RNA strand dissociate from the original template 
and anneal to a different viral RNA molecule after which RNA synthesis 
continues. Experimental evidence for this copy-choice mechanism has only 
been obtained for poliovirus recombination (Kirkegaard and Baltimore, 1986}, 
but it is assumed that this mechanism accounts for all examples of homologous 
RNA recombination (reviewed in Jarvis and Kirkegaard, 1991; Lai, 1992). 

The high-frequency RNA recombination that occurs in MHV-infected cells 
has been exploited to introduce mutations into the viral genome (Koetzner et 
al., 1992; Masters et al., 1994; van der Most et al., 1992). Recombination be- 
tween the viral genome and co-replicating synthetic DI RNAs yielded recombi- 
nant viruses carrying DI sequences. Recombinant viruses were isolated in the 
absence of any selection or by rescuing a MHV-A59 ts mutant, Albany-4 (van 
der Most et al., 1992). The ts phenotype of Albany-4 results from a deletion in 
the N gene which is repaired by recombination with DI-borne N sequences. The 
synthetic RNAs need not be replicating: Albany-4 was also rescued by using a 
synthetic mRNA7 transcript (Koetzner et al., 1992), which is most probably not 
amplified on transfection (Brian et al., 1994; Makino et al., 1991; Luytjes and 
Spaan, unpublished results). However, when a replicating RNA was used, the 
frequency with which recombinants are isolated is more than 100-fold in- 
creased (Masters et al., 1994). 

The copy-choice model of recombination proposes that transcription initi- 
ates on one template (the “donor” template) and jumps to another template (the 
“acceptor” template) after which RNA synthesis proceeds (Jarvis and Kirke- 
gaard, 1992). The number of recombinants that are generated depends on the 
concentrations of both parental RNAs: the concentration of donor template 
determines the number of initiation events, whereas the frequency of template 
switching depends on the concentration of acceptor template. This explains 
why the recombination frequency was increased more than 100-fold when a 
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replicating DI RNA was used instead of the mRNA7 transcript (Masters et al., 
1994]. 

As long as a full-length "infectious" cDNA clone is not available, the only 
way to introduce genetic changes into a coronavirus genome will be through 
homologous RNA recombination. A more general applicability of this approach 
depends on several factors. It will be important to optimize transfection proto- 
cols and to investigate which sequences can be cloned into a DI RNA without 
destroying its capacity to be replicated. Recent results from our laboratory show 
that a large part of the MHV S gene can be inserted into a DI genome, although 
the resulting DI RNA replicates less efficiently than the parental DI RNA. One 
potential limitation of this approach is that introduction of mutations into 
more internal regions of the genome would require double recombination 
events. So far, mutations have only been introduced at the 5' and 3' regions 
of the genome. Also, in the case of mutations that decrease viral fitness, screen- 
ing for recombinant viruses will be tedious. Presumably, such difficulties may 
be solved by developing efficient screening protocols and by applying selection, 
e.g., via rescue of ts lesions or by using monoclonal antibodies. Studies to 
address these issues are undoubtedly in progress. 
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CHAPTER 3 


Coronavirus Gene Expression 


Genome Organization and 
Protein Synthesis 


WILLEM LUYTJES 


I. INTRODUCTION 


Recent studies on coronaviruses have provided detailed insights into the ar- 
rangement of open reading frames (ORFs) in the coronavirus genome and the 
strategies used to translate these ORFs. This chapter will present an updated 
and detailed overview of coronavirus genomic organization as well as of the 
coding assignments of the coronavirus ORFs (genes) and their translation strat- 
egies (see also the general reviews on coronaviruses by Spaan et al., 1988; Lai, 
1990; Holmes, 1991). 

Early studies, based on gel electrophoresis of RNA, estimated the length of 
the coronavirus genome to be about 18 kilobases (kb). It was not until a com- 
plete cDNA library was cloned and sequenced for coronavirus avian infectious 
bronchitis virus (IBV) (Boursnell et al., 1987) that the size of the genome could 
be established at around 30 kb, the largest among RNA viruses. The genomic 
RNA of three other coronaviruses has since been completely sequenced, and for 
several other coronaviruses partial sequence data have been obtained. 

From this wealth of sequence information a general picture of coronavirus 
genome structure emerges, but many significant differences exist. Based on 
these differences, three groups of structurally related coronaviruses (named 
after the boldfaced member) can be distinguished: (1) murine hepatitis virus 
(MHV), bovine coronavirus (BCV}, human coronavirus OC48 (HCV-OC43), and 
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turkey coronavirus (TCV); (2) transmissible gastroenteritis virus (TGEV|, feline 
infectious peritonitis virus (FIPV], canine coronavirus (CCV|, porcine epidemic 
diarrhea virus (PEDV), and HCV-229E; and (3] IBV. 

Molecular studies have also provided new insights into translation strate- 
gies of coronavirus genes. This has led to the realization that coronaviruses 
possibly possess as wide a variety of translation strategies as can be found in the 
entire virus kingdom. 

In this chapter, we will refer to coronavirus ORFs and mRNAs according to 
the recommendations made by the coronavirus study group in 1989 (Cavanagh 
et al., 1990). Thus, mRNAs are designated by a number according to decreasing 
size, genomic RNA being 1, and ORFs are numbered corresponding to the 
mRNA from which they are expressed. In the literature variants, isolates, and 
serotypes of coronaviruses are described without any clear consensus. To avoid 
confusion we will refer to such viruses as variants. 


A. General Genome Structure 


Positive-strand coronavirus RNAs (genome and messenger RNAs) are 
capped and polyadenylated. At both ends of the genome, regions of nontrans- 
lated sequences (NTR) are present that are most likely involved in replication 
(see Chapter 2). By virtue of their nested set structure, the 3'-NTR [200-500 
nucleotides (nt}] is common to all coronavirus positive-strand RNAs. Colinear- 
ity between positive-strand RNAs is also present at the 5'-NTR but is restricted 
to the leader sequence. 

Leader sequences and 3'-NTR sequences are highly similar between corona- 
viruses from the same structural cluster, but diverge in sequence and in length 
between these clusters. In fact, only one short stretch of nucleotides, 5'- 
GGAAGAGC-3', is completely conserved in the 3'-NTR of all coronaviruses. 
This region, first detected by Lapps et al. (1987], is believed to be important for 
minus-strand RNA synthesis. 

The arrangement of the elementary genes (those that are essential for 
replication and virion structure) in the genomic RNA is the same for all corona- 
viruses studied to date (Fig. 1). The polymerase/replicase and associated func- 
tions are encoded by the 5'-two thirds of all coronavirus genomes. The major 
structural proteins, i.e., those that are found in all coronaviruses, are encoded 
by genes located in the order: 5'-S (large surface protein gene], -sM (small 
membrane protein gene], -M (membrane protein gene], -N (nucleocapsid protein 
gene), -3'. Depending on the coronavirus, additional ORFs are present at differ- 
ent positions in the genome. The functions of the proteins encoded by these 
ORFs are mostly unknown. 

Coronaviral protein synthesis requires the synthesis of mRNAs by a mech- 
anism described in Chapter 2. Briefly, short conserved stretches of nucleotides, 
previously called intergenic sequences or junction sequences but which in fact 
are mRNA promoters on the minus strands, separate coding regions on the 
genome. On the minus strands, the promoter upstream of a coding region 
initiates transcription of a mRNA containing this coding region and all se- 
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FIGURE 1. Overview of the coronavirus genome organization. The open reading frames of the five 
common elementary genes (see text} are shown. The shaded area in the S ORF represents the 
hypervariable region in this gene. Abbreviations: POL, polymerase gene; S, spike or large-surface 
protein gene; sM, small-membrane protein gene; M, membrane protein gene; N, nucleocapsid 
protein gene; la, ORF la of the POL gene; 1b, ORF 1b of the POL gene; L, leader RNA; (, cap; AAA, 
poly-A tail; >, promoter sequence (also called intergenic or junction sequence]. Only promoter 
sequences common to all coronavirus genomes are shown; I-IV, regions with additional ORFs, 
described in Figs. 2—5 and in the text. 


quences downstream. However, only the first coding region, which may com- 
prise several ORFs on the mRNA, is translated. 

The evolution of coronavirus genomes appears to involve extensive dele- 
tions or insertions in the genomic RNA. This can be deduced from the fact that 
differences between coronavirus genomes are characterized by the number and 
position of nonstructural genes. Also, differences between related, present-day 
coronaviruses mainly involves insertions or deletions in the genome. In those 
cases where ORFs are involved, these genomic changes can be associated with 
different pathogenic properties of the viruses. This is especially true for the 
large-surface protein gene. 


B. General Translation Strategy 


Coronavirus mRNAs are similar to host mRNAs insofar as they supply all 
the structural prerequisites for recognition by the eukaryotic host translation 
machinery. The initiation of translation in eukaryotes is generally accepted to 
occur through ribosomal scanning, starting at the 5'-cap structure and proceed- 
ing until an AUG codon in the preferred context is found (Kozak, 1989). Protein 
synthesis then continues until a termination codon is encountered and the 
ribosome complex detaches from the RNA and dissociates. 

All coronavirus mRNAs possess a common leader sequence that provides a 
cap structure. Therefore, by means of ribosomal scanning, an ORF encountered 
in the mRNA coding region can be translated. Indeed, early in vitro translation 
studies on coronavirus mRNAs that encode readily detectable structural pro- 
teins confirmed that only the 5'-extreme ORF, which on these mRNAs com- 
prised the entire 5’ coding region, was translated (Rottier et al., 1981; Siddell, 
1983). It was thus concluded that coronavirus mRNAs were translated accord- 
ing to the ribosomal scanning model and, although containing more than one 
coding region (as a consequence of the discontinuous transcription of corona- 
virus mRNA, see Chapter 2}, were functionally monocistronic. However, it 
soon became apparent that some coronavirus mRNAs were quite different. In 
these cases, the 5' coding region contained more than one translationally active 
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ORF. It has thus become clear, as will be described in this chapter, that corona- 
virus mRNAs can be functionally monocistronic or polycistronic. 

Little is known about translational regulation of coronavirus mRNAs. For 
the functionally monocistronic mRNAs, the levels of protein expressed will 
probably mainly depend on the amount of mRNA, which is transcriptionally 
regulated (Chapter 2). However, there are some indications that the leader 
sequence may play, at least to some extent, a role in determining translation 
levels. Hofmann et al. (1993) have found that during persistent infection of BCV 
in tissue culture cells an intraleader ORF is selected. This may downregulate 
the expression of the POL gene, resulting in persistency. Recently, Tahara et al. 
(1994) have shown that the MHV leader sequence is able to stimulate viral 
translation in cis, probably in conjunction with a virus-specified or virus- 
induced factor. Structural features such as hairpins, often implicated in transla- 
tional control, can be observed in the leader sequence of coronavirus mRNAs; 
but since these are common to all mRNAs, they will not play a role in deter- 
mining relative protein levels. Whether the length of the 5'-NTRs on the 
different mRNAs, which can vary considerably, especially between corona- 
viruses, plays any role in determining translation efficiency remains to be 
defined. 

Translation regulation mechanisms are more evident for the functionally 
polycistronic mRNAs. The best studied of these mechanisms is ribosomal 
frameshifting, occurring in the POL mRNA. Also, leaky scanning and internal 
entry by ribosomes have been proposed for the translation of other coronavirus 
ORFs and these mechanisms will be discussed in more detail. 


II. CORONAVIRUS GENOME: CODING ASSIGNMENTS AND 
TRANSLATION STRATEGY OF CONSERVED GENES 


A. The Polymerase (POL) Coding Region 


The coronavirus genome is infectious; thus the proteins responsible for 
replication and mRNA transcription must be translated from the incoming 
genomic RNA. The polymerase coding region is at the 5'-end of every corona- 
virus genome. In IBV, MHV-JHM, MHV-A59, and HCV-229E, it comprises two 
overlapping reading frames, ORF la and 1b. In other coronaviruses, this is 
probably also true, but sequence data are only available for the viruses men- 
tioned. The POL gene of IBV is 19,887 nt (Boursnell et al., 1987), that of MHV- 
JHM 21,798 nt (Lee et al., 1991), that of MHV-A59 21,530 nt (Bredenbeek et al., 
1990b; Bonilla et al., 1994), and that of HCV-229E is 20,274 nt in length (Herold 
et al., 1993). 

Soon after the sequence of ORF la and 1b of IBV was determined it was clear 
that translation of ORF Ib could not be explained by ribosomal scanning, which 
would lead to translation of ORF 1a only. It was noticed, however, that the 
configuration around the 42 nt overlap of the two polymerase ORFs resembled 
that in retroviruses where ribosomal frameshifting occurs. Indeed, Brierley et 
al. (1987) could show in vitro, using SP6 transcripts of IBV cDNA covering the 
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frameshift region and flanking reporter genes, that two overlapping reading 
frames could be translated to a transframe protein by (—1) ribosomal frameshift- 
ing. It was later established that the same is true for MHV-A59 (Bredenbeek et 
al., 1990b) and HCV-229E (Herold et al 1993). This frameshifting event is 
mediated by a RNA-pseudoknot structure which causes slippage of the ribo- 
somes on a stretch of adenines and uridines (UUUAAAC) just upstream of the 
pseudoknot structure (Brierley et al., 1989). The pseudoknot is essential for this 
event: it could not be replaced by other stem-loop structures and mutations 
were only allowed when they were compensated on the complementary arms of 
the pseudoknot stems (Brierley et al., 1991). From these experiments it can be 
inferred that POL ORF lb is translated in vivo by ribosomal frameshifting (see 
Chapter 10). 

The POL ORFs have a coding capacity for an extremely large replicase/ 
transcriptase protein, which is most likely extensively cleaved. Comparison of 
the complete sequences of the POL coding regions of IBV, MHV, and HCV-229E 
and short stretches of 3'-ORF 1b sequence obtained for other coronaviruses 
revealed that ORF Ib is highly conserved. However, between coronaviruses, 
ORF la is only moderately conserved at the 3'-end. Probably, ORF 1b encodes 
general coronavirus polymerase functions, while ORF la may contain more 
specific functions, for example, those related to replication in the specific host 
cell. 

In vitro translation studies of purified gnomic RNA of MHV produced a 
28-kDa and a 220-kDa protein from a 250-kDa precursor. This cleavage was 
sensitive to the protease inhibitor leupeptin (Denison and Perlman, 1986, 1987). 
The 28-kDa protein could be identified as the N-terminal cleavage product of 
ORF la (Soe et al., 1987). Baker et al. (1989, 1993) subsequently showed that this 
protein is the product of a cis-acting papainlike cysteine protease activity, with 
Cys-1137 and His-1288 as catalytic residues. 

Using a panel of antibodies raised against bacterial fusion proteins and anti- 
peptide sera derived from the 5'-6.5 kb of ORF la of MHV-A59, a number of 
processing products were detected in infected cells (Denison et al., 1992; Weiss 
et al., 1994). These proteins were mapped to ORF la in the order, p28-p65-p50- 
p240, the latter two being cleavage products of a p290 precursor. Cleavage of p28 
occurred early and was insensitive to the protease inhibitor leupeptin, which is 
in contrast to the in vitro data. The p290 cleavage is sensitive to leupeptin, 
suggesting that two distinct proteases are involved in processing this part of the 
ORF 1a product. | 

At the other end of the POL gene, Bredenbeek et al. (1990c) identified a 33- 
kDa protein among the in vitro translation products of the 3'-most 1.85 kb of 
the MHV-A59 ORF Ib. They used antibodies raised against a peptide consisting 
of the C-terminal amino acids of ORF Ib. Denison et al. (1991) identified several 
products of ORF 1b of MHV-A59 using antibodies raised against fusion proteins 
containing sequences from the 3'-2 kb of this ORF. Among the in vitro transla- 
tion products of purified genomic RNA, five different polypeptides could be 
detected: p90, p74, p53, p44, and p32. The latter possibly corresponds to the 33- 
kDa protein found by Bredenbeek et al. (1990a). As for the in vitro data on ORF 
la products, the fact that protease inhibitor did not affect cleavage of the ORF 1b 


38 WILLEM LUYTJES 


in vitro products may indicate that care should be taken in extrapolation to the 
in vivo situation. For IBV, Brierley et al. (1990) have found several POL products 
in virus-infected cell lysates using antibodies directed to fusion proteins con- 
taining parts of the POL ORFs. 

As yet, no function can be assigned to any of the polypeptides detected, nor 
is it known how the ORF 1a/1b products relate to the large polymerase precur- 
sor. As was shown in the one case for MHV by Baker et al. (1989, 1993], virus- 
encoded proteases, encoded somewhere in the POL ORFs, are believed to be 
responsible for the processing of the POL precursors. Computer analyses of the 
polymerase protein sequences have led to the prediction of several putative 
protease domains in the POL protein, as well as potential protease cleavage sites 
(Gorbalenya et al., 1989, 1991; Lee et al., 1991; see Chapter 10). 


B. The Large-Surface (S) Protein Gene 


The largest structural gene common to coronaviruses is the large-surface 
protein or spike protein gene. The S protein is the most prominent protein on 
coronavirions. Anchored in the viral envelope, it mediates recognition of and 
binding to the receptor on the host cell and fusion with its membrane. In the 
TGEV and IBV clusters, S is translated from mRNA 2, and in the MHV cluster 
from mRNA 3. The S ORF encodes a signal sequence, and its translation will, 
thus, direct ribosomes to the endoplasmic reticulum, allowing direct insertion 
into the membrane. The S gene has been cloned for most coronaviruses and 
expressed using a variety of systems. Functions and biological properties of the 
S protein will be discussed in Chapter 5. 

The S gene of coronaviruses from the different clusters varies in size from 
3675 to 4350 nt. Size differences have been mapped to the 5'-part of the gene (De 
Groot et al., 1987). Large size differences have also been located in the 5'-region 
of the S genes encoded by closely related coronaviruses (TGEV/PRCV: Ras- 
schaert et al., 1990; Britton et al., 1991; Wesley et al., 1991), by related strains 
(MHV-A59/THM: Luytjes et al., 1987; Schmidt et al., 1987; other MHV strains, 
Parker et al., 1989b; BCV: Zhang et al., 1991a) and by variants (La Monica et al., 
1991). Apparently, this part of the coronavirus genome is highly variable, prob- 
ably since it encodes parts of the S protein associated with host cell recognition 
and the immune response. The 3' part of the S gene is more conserved between 
coronaviruses and contains information necessary for the structure of the spike 
oligomers (De Groot et al., 1987]. 


C. The Small-Membrane (sM) Protein Gene 


The small-membrane protein gene of coronaviruses has only recently been 
recognized as a structural protein gene. In 1990, Smith et al. reported the first 
observation that a small protein encoded by IBV mRNA 3 was membrane- 
associated. Subsequently, it was established that this protein (then designated 
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3c) is associated with the viral envelope, and thus represents a new structural 
(membrane) protein of coronaviruses (Liu and Inglis, 1991). In TGEV, a protein 
with similar properties was detected [Godet et al., 1992) and the name "small- 
membrane protein" was suggested. Recently, Yu et al. (1994) have identified a 
homologous protein in the virus particles of MHV-A59. The sM protein will be 
discussed in Chapter 9. 

It now seems clear that the sM ORF will be present in all coronaviruses. 
The ORF is invariably located directly upstream of the M ORF. Amino acid and 
nucleotide homology between the different sM ORFs is limited, but the general 
predicted structure is highly similar. The strategy used for the translation of the 
sM ORF is very different between the members of the three coronavirus clus- 
ters. In IBV, the sM ORF is the third ORF in the 5’ coding region of the 
tricistronic mRNA 3. Using synthetic mRNAs, in which the influenza virus 
nucleoprotein (NP) ORF was placed in front of the three IBV mRNA 3 OREs, 
only NP and sM proteins were translated in vitro. When the region containing 
the 3a and 3b ORFs was deleted, no translation of sM was observed (Liu and 
Inglis, 1992b). Thus, it was concluded that the sM ORF is translated by internal 
ribosomal entry in the region containing the 3a and 3b ORFs. Internal ribosome 
entry sites (IRES) are characterized by a specific RNA structure: the ribosomal 
landing pad (RLP). Such an RLP structure has been predicted by Liu and Inglis 
(1992b), and recently Le et al. (1994) have suggested an alternative folding of the 
upstream RNA into an RLP structure, similar to that found in picornaviruses. 
Interestingly, unlike picornaviruses, the IBV RLP structure comprises func- 
tional ORFs (3a and 3b, see Section III.B.). Also, the coronavirus IRES would be 
unique in that it occurs on a polycistronic RNA. 

The sM ORF in MHV is located as the second, downstream ORF on mRNA 
5 (see Fig. 3). The sM ORF product was detected in infected cells and in virions 
(Leibowitz et al 1988; Yu et al., 1994). In vitro, the sM ORF is translated much 
less efficiently than the upstream 5a ORF. However, recent in vitro translation 
data indicate that expression of the MHV sM ORF may also be mediated by 
internal ribosome entry (Thiel and Siddell, 1994). This would be similar to the 
situation in IBV. 

In BCV and OC43, sM is expressed from monocistronic mRNAs 5-1 and 5, 
respectively. The sM ORF product could be detected in BCV-infected cells by 
immunofluorescence using antiserum raised against the MHV-A59 sM protein. 
The BCV sM is similar, 50-60% at the amino acid level, to its MHV counterpart 
(Abraham et al., 1990; Woloszyn et al., 1990). It will be interesting to establish 
whether BCV harbors an IRES in the mRNA 5 coding region. If not, the se- 
quence differences between these viruses might reflect the position of essential 
domains in the MHV RLP. 

In the members of the TGEV cluster, the sM gene is expressed from mono- 
cistronic mRNA 4 (equivalent to HCV-229E mRNA 5). The size of the gene is 
similar to those found in the members of the MHV cluster. An interesting topic 
for future research will be to investigate why only some coronaviruses require 
such an elaborate mechanism of translation for the sM protein. It may be that 
adaptation to the host is involved. 
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D. The Membrane (M) Protein Gene 


The membrane-associated protein is translated from the M gene. The M 
protein assumes a tightly membrane-associated conformation which is be- 
lieved to be crucial for virion structure formation. The properties of the M pro- 
tein will be discussed in Chapter 6. The predicted membrane topology of the M 
protein is highly conserved between coronaviruses, and the gene does not show 
any large insertions or deletions. Yet, M protein sequences between viruses 
from different clusters are only moderately similar. The M gene ranges in size 
from 675 to 786 nt. The main area of divergence is at the 5'-end, where the M 
genes of the TGEV cluster encode signal sequences that target M to the ER 
(Kapke et al., 1988]. In some variants of IBV and some enteric strains of MHV 
differences were also observed in this region (Cavanagh and Davis, 1988; Hom- 
berger, 1994]. 

Surprisingly, the M gene sequence of the avian coronavirus TCV is almost 
identical (only two nts are different) to that of mammalian coronavirus BCV 
(Verbeek and Tijssen, 1991), a level of sequence similarity not found between 
any other two coronaviruses. 


E. The Nucleocapsid (N] Protein Gene 


The nucleocapsid gene encodes the protein that encapsidates the genomic 
RNA. It is translated from the most abundant viral mRNA, which in most 
coronaviruses is the smallest: mRNA 7 in viruses from the MHV cluster and 
mRNA 6 in those from the IBV cluster. Coronaviruses from the TGEV cluster 
express the N gene from mRNA 6, but these viruses, excepting HCV-229E and 
PEDVY, also express a smaller mRNA (see Section IILD.). 

The amino acid similarity of the N proteins between coronaviruses from 
the different clusters is low. The N genes of coronaviruses from a single cluster 
are generally of similar size. However, the PEDV N gene is 135 nt larger than 
that of the other members of this cluster (Bridgen et al., 1993). The authors 
suggest that the extra sequence may be the evolutionary result of polymerase 
stuttering, since it contains a 36 nt (12 amino acid) periodicity. From an evolu- 
tionary point of view, TCV is again noticeable: its N gene has merely a single nt 
difference (including the 3'-NTR) from the same area in BCV (Verbeek and 
Tijssen, 1991). 

Interestingly, in coronaviruses from the MHV cluster, i.e., BCV (where it 
was first recognized by Lapps et al 1987), TCV, SDAV (Kunita et al 1993), and 
MHV strains A59, S, 1, and 3, but not JHM (Parker and Masters, 1990], and in 
those from the TGEV cluster, i.e., PEDV and HCV-229E (Schreiber et al 1989; 
Bridgen et al., 1993], an internal ORF of considerable length (612—660 nt in the 
MHV cluster; 336 nt in the TGEV cluster) is present in the N gene with an AUG 
codon in a favorable context. It has been shown by Senanayake et al. (1992) that 
this ORF is expressed in BCV-infected cells and encodes a membrane-associated 
protein. Thus, BCV mRNAT is functionally bicistronic, probably by means of a 
ribosomal leaky scanning mechanism. In HCV-OC43, the homologue of the 
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internal ORF of BCV is shorter because of a point mutation (Kamahora et al., 
1989). Recently it has been suggested by Schaad and Baric (1993) that additional, 
short internal ORFs in the N gene of MHV might exist which are preceded by 
promoter-similar sequences that would produce subgenomic mRNAs. This 
observation remains to be confirmed as it could reflect aberrant initiation 
events. More detailed information on the coronavirus N protein is presented in 
Chapter 7. 


II. CORONAVIRUS GENOME: CODING ASSIGNMENTS AND 
EXPRESSION OF NONCONSERVED GENES 


A. ORFs between the POL and S Genes: Region I 


In the MHV cluster of coronaviruses, two ORFs, which are not present in 
members of the TGEV and the IBV clusters, are located between the polymerase 
and the large-surface protein gene (Fig. 2). The first of these, ORF 2a, is trans- 
lated from mRNA 2 and encodes a small protein designated ns2a or 30 kDa. 
This protein has been identified as a phosphoprotein in BCV (Cox et al., 1991). 
Early in vitro translation experiments showed a product of MHV mRNA 2 of 
approximately 30,000 molecular weight (Leibowitz et al., 1982; Siddell, 1983). 
Using antibodies raised against a fusion product of the cloned gene of MHV- 
A59, the protein could be detected in cells infected with either MHV-A59 or 
MHV-JHM but not in purified virions, suggesting a nonstructural nature (Bre- 
denbeek et al., 19902; Zoltick et al., 1990). ORF 2a has been sequenced for MHV 
strains A59 (Luytjes et al., 1988), JHM (Shieh et al., 1989), and for BCV (Cox 
et al., 1989). In the MHV-JHM variant Wb] a large deletion has eliminated the 
entire 5'-end of ORF 2a, including the mRNA2 promoter. Consequently no 30- 
kDa protein is synthesized (Schwarz et al., 1990). This deletion does not affect 
growth in tissue culture, suggesting that the 30-kDa protein may only be 
important to the virus in its natural host. 

The second ORF in this region, exclusive to viruses of the MHV cluster, 
encodes a protein that constitutes the major difference in virion structure 
between the MHV, TGEV, and IBV groups. It has been designated HE (hemagglu- 
tinin esterase) in analogy with the influenza C virus hemagglutinin- esterase- 
fusion (HEEL, with which it shows a high degree of structural and functional 
homology (Luytjes et al., 1988; Vlasak et al., 1988a,b). The HE gene is translated 
from mRNA 2-1 and encodes a 65,000 molecular weight protein, which in its 
dimeric form is the second spike protein for several viruses from the MHV 
cluster. HE translation is signal sequence mediated, and the protein is inserted 
into the ER membrane through a transmembrane domain which may be the 
homologue of the HEF fusion domain. It has been hypothesized that the HE 
gene has been acquired by a RNA-recombination event between ancestral 
coronaviruses and influenza C viruses (Luytjes et al., 1988]. 

The HE locus in MHV is highly heterogeneous between strains, but also 
within a single strain. In MHV-A59, where it was first detected, the HE gene is 
not expressed, since it lacks an initiation codon (Luytjes et al., 1988). More 
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FIGURE 2. Genetic structure of coronavirus RNA in region I between the POL and S genes. The left 
panel indicates the coronavirus; the right panel gives a diagram of the genomic structure. Boxes 
represent reading frames. Functional open reading frames are in white boxes, stippled boxes show 
the outline of nontranslated reading frames. The positions of the ORFs are represented relative to 
that of the upstream flanking, conserved ORF (names shown in triangles). Deletions are indicated 
by A. ND = not determined. In the upper, shaded box the nomenclature of the ORFs is displayed (see 
text). Variants of MHV-1/2/3 are indicated above their respective reading frames. Other legends are 
as described in Fig. 1. 


important, mutations have destroyed the upstream promoter that determines 
the synthesis of mRNA 2-1. In other MHV strains the HE gene is truncated at 
the 3'-end by mutations and insertions/deletions that have introduced transla- 
tion termination codons and thus preclude membrane insertion. The gene for a 
full-length HE protein is present in MHV strains S and JHM, although the HE 
protein was only detected at low levels in MHV-JHM virions (Shieh et al., 1989; 
Yokomori et al., 1991). Single-strain heterogeneity is indicated by the observa- 
tion that MHV-JHM(2) variants, which contained HE regions truncated to dif- 
ferent extents, could be isolated over a period of time from the brains of infected 
animals (Yokomori et al., 1993). 

The HE gene has been cloned and sequenced for four BCV strains (Quebec: 
Parker et al., 1989a; Mebus: Kienzle et al., 1990; L9 and LY138: Zhang et al., 
1991b) and one HCV strain (OC43: Zhang et al., 1992). In these viruses the gene 
is highly conserved and shows none of the variability in structure observed for 
MHV. The functional HE gene ranges in size from 1275 to 1320 nt. Chapter 8 will 
present a more detailed analysis of the function and structure of the HE protein. 
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B. ORFs between the S and sM Genes: Region II 


There is great variability between coronaviruses in the area of the genome 
between the S and the sM genes, in the size and in the number of ORFs present 
(Fig. 3). In IBV, two ORFs (3a and 3b) are present, each of which is expressed from 
mRNA 3. IBV mRNA 3 also encodes the sM protein (see Section ILC.). The 
corresponding proteins (3a: 6700 molecular weight; 3b: 7400 molecular weight} 
have been detected in IBV-infected cells, and both ORFs can be translated from 
T7/SP6 transcripts of cloned cDNA of mRNA 3 (Smith et al 1990; Liu et al., 
1991). IBV mRNA 3 is thus functionally poly(trilcistronic. 

Smith et al. (1990) proposed that translation of the IBV 3a and 3b ORFs 
is governed by the "leaky scanning" model of Kozak (1987, 1989). In this model 
the strength of the initiation codon of a downstream reading frame on a poly- 
cistronic mRNA has to be higher than that of the ORF that is located upstream. 
Skipping of the preceding, weaker AUGs by a portion of the ribosomes enables 
production of the downstream encoded proteins. Consistent with this model, 
the AUG codon of ORF 3a is in a weak context and no additional AUG codons 
are present in these ORFs. Note that ORFs 3a and 3b are in the region of 
mRNA 3 considered to fold into the RLP for translation of the sM ORF (see 
Section DLC 

In MHV, 2 or 3 ORFs are located between genes S and sM. In MHV-JHM, a 
single ORF 4 is translated from mRNA 4 and encodes a 15,000 molecular weight 
protein, which has been detected in infected cells (Ebner et al., 1988). MHV-A59 
contains two ORFs in this region: 4a (predicting a protein of 2200 molecular 
weight) and 4b (predicting a protein of 11,700 molecular weight; Weiss et al., 
1993). Alignment of the sequences of MHV-JHM and A59 from this area indi- 
cates that two single nt insertions/deletions have caused the difference. The 
short ORF 4a of MHV A59 is in a different reading frame and ORF 4b is the 
shorter homologue of the MHV-JHM ORF 4. In vitro translation of T7- 
transcribed RNA containing ORF 4a and 4b produced only a 2.2-kDa protein. 
Also, antibodies against the 15-kDa ORF 4 product of MHV-JHM did not detect 
any 11.7-kDa protein in MHV-A59-infected cells (Weiss et al., 1993). Thus, early 
assumptions that the 14-kDa protein observed in analyses of MHV-A59 encoded 
proteins (Rottier et al., 1981) and cell-free lysates (Leibowitz et al., 1982) is a 
product of the coding region of mRNA 4 may be incorrect. 

The sM ORF of MHV mRNA 5 (ORF 5b] is expressed preferentially to ORF 
5a in in vitro translation (Budzilowicz and Weiss, 1987). Since the product of 
ORF 5a has not yet been identified, it remains to be seen whether this mRNA is, 
like mRNA 3 of IBV, functionally polycistronic. If so, it would be the only corona- 
virus mRNA that is functionally monocistronic from a downstream ORF. 

Nothing is known about the function of the ORF 4 (a,b) and ORF 5a gene 
products. However, in MHV strain S, which is easily propagated in tissue 
culture, ORF 5a is deleted and ORF 4 is not expressed because synthesis of 
mRNA 4 has been eliminated by mutations in its promoter. It thus appears that 
these proteins are not required for replication, at least not in vitro and for this 
strain of MHV (Yokomori and Lai, 1991]. 

In BCV, the region between the S and sM genes contains 3 ORFs (Abraham 


44 WILLEM LUYTJES 


Purdue 
FS772/70 


. Miller/PP3 
TGEV: 





FIGURE 3. Genetic structure of coronavirus RNA in region II between the S and sM genes. The 
three structural clusters are outlined in separate panels. The numbers in the upper corners of the 
left panels represent the approximate lengths of the region in the respective clusters. Other legends 
are as in Figs. 1 and 2. 
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et al., 1990). The first two of these (ORFs 4a and 4b) are located in the coding 
region of mRNA 4, are homologous to MHV-JHM ORF 4, and may have arisen 
from a single base deletion in an ancestral BCV ORF 4. ORF 4a encodes a protein 
of 4900 molecular weight and can be translated in vitro from mRNA 4. The 
putative product of ORF 4b would be a protein of 4800 molecular weight, but it 
has not been detected after in vitro translation (Abraham et al., 1990). Unlike 
MHV ORFs 4/4a, BCV ORF 4a overlaps briefly with the end of the S gene. 
Accordingly, the promoter responsible for the generation of mRNA 4 is located 
some 330 nt inside the upstream gene. The third BCV ORF in this region is 
located in the coding region of mRNA 5, which has its promoter located 
internally in ORF 4b. ORF 5 encodes a protein of 12,700 molecular weight 
(Abraham et al., 1990; Woloszyn et al., 1990]. 

The HCV-OC43 strain reported by Mounir and Talbot (1993) also shows 
considerable variation in this area compared to BCV, although otherwise both 
viruses are highly homologous at the nucleotide and amino acid levels. The 
difference is due to a large deletion in the mRNA 4 coding region of HCV-OC43. 
This deletion has left only 11 amino acids from ORF 4a. The authors found that 
this truncated ORF is followed by a copy of a large part of the HCV-OC43 leader 
sequence. This finding is possibly an artifact, but it is not unique because 
Taguchi et al. (1994) have found a similar arrangement between the M and N 
genes of MHV strain S. The partial leader copy is succeeded by the homologue of 
the BCV ORF 5, although for HCV-OC43 it should be designated ORF 4b 
because it is located in the coding region of mRNA 4. The deletion event also 
removed the promoter that would give rise to the OC43 version of BCV mRNA 
5. Whether this configuration occurs in other OC43 variants remains to be 
determined. 

In the coronaviruses of the TGEV cluster, the region between the S and sM 
genes is extremely variable. Generally, two ORFs are present. In two TGEV 
strains, FS772/70 and Purdue-115, the ORFs are located on mRNA 3 (Britton et 
al., 1989; Rasschaert et al., 1990). Therefore, the ORFs have been designated 
ORF 3a and 3b. However, Wesley et al. (1989) reported for the Miller/PP3 strain 
of TGEV that these ORFs were each translated from a separate RNA, designated 
mRNAs 3 and 4, respectively. Consequently, these ORFs were named ORE 3 
and 4. Following the proposed coronavirus nomenclature and to preserve anal- 
ogy with other coronaviruses, these mRNAs are called mRNAs 3 and 3-1, 
encoding ORFs 3a and 3b. 

In TGEV-related strains porcine respiratory coronavirus (PRCV) 86/137004 
and 86/135308, the promoter responsible for the synthesis of mRNA 3 has been 
deleted. A new one has been generated by mutations approximately 300 nt 
downstream, just in front of the AUG of ORF 3b (Page et al., 1991). Thus, PRCV 
synthesizes a different mRNA 3 (related to mRNA 3-1 of TGEV Miller/PP3], 
containing ORF 3. To make the picture even more complicated, a TGEV Miller/ 
PP3 variant, SP, contains a large deletion stretching from just upstream of the 
promoter for mRNA 3 into ORF 3b and this virus produces no mRNA 3 at all 
(Wesley et al., 1990). 

Coronavirus CCV is closely related to TGEV strains FS772/70 and Purdue. 
As in these strains, two CCV ORFs are present, 3a and 3b, both in the coding 
region of mRNA3. While the nt sequences are highly similar to TGEV, ORF 3b 
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is truncated by 20% compared to TGEV 3b, because of an acquired termination 
codon (Horsburgh et al., 1992). 

In FIPV, the counterparts for the TGEV ORFs 3a and b are reportedly 
present, but mutations have split ORF 3b into two separate ORFs and an extra 
ORE, starting in 3a, can be observed, bringing the number of FIPV ORFs in this 
region to 4 (Vennema, 1991). These four ORFs are apparently located in the 
coding region of mRNA 3. There are no data on expression of these ORFs. 

Two other members of the TGEV cluster, PEDV and HCV-229E, are some- 
what divergent in region II, since both seem to lack a homologue of ORF 3a. In 
PEDV, one ORF is present between the S and sM ORFs (Duarte et al 1994). This 
ORF 3 is related to ORF 3b in the TGEV strains and is located in the coding 
region of mRNA 3. ORF 3 appears to be polymorphic, as sequence variation was 
observed between independent clones and between two variants, which in- 
cluded deletions leading to truncation of the ORF. HCV-229E contains two 
ORFs in this area, which together are homologous to PEDV ORE 3. They briefly 
overlap and are located in the coding region of a single mRNA (Raabe and 
Siddell, 1989; Raabe et al 1990). The nomenclature for the HCV-229E mRNAs 
used by the authors is different from that for the other members of this cluster. 
They have found an extra RNA with a coding region starting internally in the S 
gene, not only by hybridization but also by metabolic labeling (Schreiber et al., 
1989). Therefore, the next smaller mRNA is numbered 4, rather than 3. Conse- 
quently, the ORFs located between the S and sM genes are named 4a and 4b. For 
the sake of clarity, it may be better to number these ORFs 3a and 3b, on the 
coding region of mRNA 3. The upstream promoter in S would then produce 
RNA 2-1. There is no information on the expression and biological relevance of 
these ORFs. 


C. ORFs between the M and N Genes: Region III 


In IBV, extra ORFs can be found between the M and the N genes (Fig. 4). 
Two ORFs, 5a and 5b, encoding proteins of 7400 and 9500 molecular weight, 
respectively, are present. The ORFs are both expressed from mRNA 5 and can 
be detected in infected cells, implying that the IBV mRNA 5 is also functionally 
polycistronic (Liu and Inglis, 19922). The “leaky scanning" model does not seem 
to apply here since the AUG codon of ORF 5a is in a stronger context than that 
of ORF 5b. The translation mechanism of ORF 5b therefore remains to be 
determined. 


FIGURE 4. Genetic structure of coronavirus 
RNA in region III between the M and N 
genes. Legends are as in Figs. 1 and 2. 
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FIGURE 5. Genetic structure of co- 
ronavirus RNA in region IV down- 
stream of the N gene. Legends are as 
in Figs. 1 and 2. 





D. ORFs Downstream of the N Gene: Region IV 


Unlike viruses from the MHV and IBV clusters, coronaviruses from the 
TGEV cluster encode proteins downstream of the N gene (Fig. 5). As was deter- 
mined by De Groot et al. (1988), two ORFs are present in this region of the FIPV 
genome: ORF 7a and 7b, located on mRNA 7, which is generated from a pro- 
moter immediately downstream of the N gene. In TGEV, only one ORF is found 
(ORF 7), apparently a deletion product of the homologue of ORF 7a of FIPV 
(Kapke and Brian, 1986; Rasschaert et al., 1987; De Groot et al., 1988; Britton et 
al., 1988). Coronaviruses CCV and FECV are similar to FIPV. The CCV ORFs are 
colinear with those of FIPV but only 60-80% homologous at the amino acid 
level. In contrast, the FECV ORFs are 89-9995 homologous, but ORF 7b has 
been truncated by a point mutation (Vennema et al., 1992a; Horsburgh et al., 
1992). The 9100-molecular-weight translation product of ORF 7 of TGEV 
(Garwes et al., 1989; Tung et al., 1992) and a 14,000-molecular-weight product of 
ORF 7b of FIPV, CCV, and FECV (Vennema, 1991) have been detected in vivo by 
immunoprecipitation. The ORF 7 protein of TGEV is endoplasmic reticulum 
(ER) membrane associated (Garwes et al., 1989; Tung et al., 1992), leading to the 
suggestion that it may play a role during virus assembly or budding. The ORF 7b 
protein of FIPV is a soluble protein containing the amino acid sequence KTEL, 
an ER-retention signal (Vennema et al., 1992b]. This suggests that it can be 
retrieved into the ER lumen after transport. Its role in virus replication is not 
yet clear. If a product of ORF 7a can be detected in FIPV-infected cells, the FIPV 
mRNA 7 will be another example of a coronavirus mRNA that is functionally 
polycistronic. The separate position held by HCV-229E and PEDV in the TGEV 
cluster is again illustrated by the fact that these viruses do not contain any 
ORFs or promoters in this region (Schreiber er al., 1989; Bridgen et al., 1993). 


IV. CONCLUSION 


The genomes of coronaviruses apparently contain a relatively flexible set of 
conserved reading frames that are essential and variable or nonconserved read- 


48 WILLEM LUYT]JES 


ing frames that often appear to be dispensable in vitro. It should be noted, 
however, that many viruses described in the literature are tissue culture vari- 
ants, or at least adapted to growth in tissue culture. They may not be infectious 
to the natural host in their mutated form and dispensable ORFs may indeed 
prove essential to the virus in the field. 

Nevertheless, the data do emphasize the adaptability of coronaviruses and 
the flexibility of the genome. The implications for evolution are apparent. The 
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FIGURE 6. Summary of translation strategies used by coronaviruses. Diagrams of coronavirus 
RNAs are represented with a cap (C), the leader (L), promoters (>) bordering the coding region of the 
RNA, its functional ORF in white, and nontranslated ORFs in gray. The nontranslated ORFs are 
generally downstream of the second promoter on the RNA and therefore not in the coding region 
(a-1). In one case (MHV-A59 mRNA 4], a nontranslated ORF occurs inside the coding region (a-2] and 
in another case (MHV-A59 mRNA 5] only the downstream ORF is translated (a-3). In b through e, 
only the coding region is shown and it is indicated how the additional downstream ORFs in coding 
regions are translated. See the text for additional information. (a) The ribosomal 40S subunit 
associates with the mRNA cap via cap binding and initiation factors, scans the RNA, finds the 
initiating AUG codon, whereupon the 60S subunit associates and translation follows. After reach- 
ing the termination codon, the complex dissociates and no further downstream ORFs are trans- 
lated. (b,c) A second downstream ORF (b) or internal ORF (c) is translated by leaky scanning. Thus, a 
portion of the ribosomal 40S subunits fail to recognize the AUG codon of the first ORF and 
initiation at the AUG codon of the downstream ORF occurs. (d) A large, downstream ORF is 
translated by (—1) ribosomal frameshifting, (mediated by a slippery sequence and RNA-pseudoknot) 
into this ORF. (e) A ribosomal landing pad, which contains functional ORFs, serves as an internal 
ribosomal entry site to allow translation of the downstream ORF indicated. In the right panel 
examples of the ORFs for which the particular translation strategy is used, oris likely to be used, are 
given. 
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potential of coronaviruses to evolve rapidly, by their ability to incorporate new 
genes, is most convincingly illustrated for the HE gene. For coronaviruses of the 
MHV cluster, the biological properties acquired with the HE gene are not fully 
understood, but the available data indicate that HE is involved in pathogenesis 
and the acquisition of the gene constitutes a significant evolutionary advantage 
to the viruses involved. The mechanism by which new genes are incorporated 
into the genome is very likely to be RNA recombination between heterologous 
RNAs. Obviously this will be a major subject of study on coronaviruses in the 
coming years. | | 

Viruses have often developed alternative strategies for the expression of 
proteins essential for replication. Translational mechanisms different to the 
strategy of ribosomal scanning are also believed to allow for the regulation of 
protein levels. It now appears that coronaviruses employ a whole set of alterna- 
tive translation strategies, including ribosomal frameshifting, internal ribo- 
somal entry, and leaky scanning (Fig. 6). It is likely that ribosomal frameshifting 
in the POL gene provides a crucial means of regulating expression of the vital 1b 
ORF functions and is used by all coronaviruses. More puzzling is the expression 
of the sM ORF. Closely related coronaviruses use different strategies to express 
this ORF, which may reflect a fundamental difference between these viruses. It 
will be interesting to find out what these differences are. In particular, it will be 
especially important to find out whether the IRES elements of IBV and MHV 
function on genomic RNA, which would imply that the sM ORF in these 
viruses can be translated from the incoming genome. 

Detailed analyses of coronavirus translation mechanisms, replication 
strategies, and the relative importance of the different ORFs, especially those of 
the polymerase coding region, are in progress. The analysis of the biological 
functions of these ORFs awaits the use of site-directed mutagenesis and the 
techniques of reverse genetics, which, hopefully, may soon become available to 
coronavirologists. 
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CHAPTER 4 


Coronavirus Receptors 


KATHRYN V. HoLMES AND SUSAN R. COMPTON 


I. CORONAVIRUS SPECIES AND TISSUE TROPISMS 


Coronaviruses are highly species-specific in that they generally cause disease in 
only one host species (Móstl, 1990; Wege et al., 1982). However, experimental 
inoculation of other species with several coronaviruses, either by artificial 
routes such as intracerebral inoculation or in the highly susceptible neonatal 
period, can result in mild or asymptomatic infection as shown in Table I. In 
general, coronaviruses only infect cells from their normal host species or from 
species that are susceptible to infection with an antigenically related coro- 
navirus (Table II). Host-dependent differences in susceptibility to coronavirus 
infection can be demonstrated within a species. For example, different strains of 
inbred mice vary greatly in their susceptibility to infection with various murine 
hepatitis virus (MHV) strains (Bang and Warwick, 1960; Stohlman et al., 1980; 
Wege et al., 1982). 

Coronaviruses exhibit strong tissue tropisms in vivo. While coronavirus 
infections are usually initiated in the respiratory and/or enteric epithelium and 
some coronavirus infections are limited to these tissues, several coronaviruses 
can spread to specific other organs such as the liver, lymphoid organs, brain, 
peritoneum, or kidney. Different isolates of the same or closely related coro- 
naviruses may exhibit differing degrees of tropism for enteric or respiratory 
epithelium. For example, more than 20 isolates of MHV are divided into two 
biotypes based on their initial site of replication in either the respiratory or 
enteric tract (Barthold, 1986). 

In this chapter, we will discuss the role of virus receptors as determinants 
of the species specificity, host strain specificity, and tissue tropism of corona- 
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TABLE I. Infection of Animals with Coronavirus 


Coronavirus Mouse Rat Cow Turkey Human Pig Dog Cat References 


MHV d, Da bb a 

SDAV d A d, D b,c 

BCV d b d, A ac A b, d, e, f, g 
TCV d, D 

HEV b d, A h 
HCV-OC43 d d, D b,i 
HCV-229E d, D A j 

TGEV d, D a A k, 1, m 
CCV aA dA aA nopqr 
FIPV d, A dD ns 


aBoldface indicates infection of natural host; nonboldface indicates infection of a foreign species. 

bLower case letters indicate neonatal animal; upper case letters indicate weanling/adult animal. 

ca or A indicates asymptomatic infection following natural route of exposure; b or B indicates disease following 
intracerebral inoculation only; d or D indicates disease following natural route of exposure. 

References: a, Cheever et al., 1949; b, Barthold et al., 1990; c, Bhatt et al., 1977; d, Kaye et al., 1975; e, Akashi et al., 
1981; f, Dea et al., 1991; g, Storz and Rott, 1981; h, Hirahara et al., 1992; i, McIntosh et al., 1967; j, Barlough et al., 
1985; k, Larson et al., 1979; 1, McClurkin et al., 1970; m, Reynolds et al., 1979; n, Woods et al., 1981; o, Woods and 
Wesley, 1986; p, Barlough et al., 1984; q, Stoddart et al., 1988; r, McArdle et al., 1992; s, Woods and Pederson, 1979. 


virus infection. It is important to note, however, that susceptibility to corona- 
virus infection and disease depends on many other host-dependent factors in 
addition to receptor availability and specificity, including intracellular deter- 
minants of virus replication and immunological responses to virus infection. 


I. CORONAVIRUS ENVELOPE GLYCOPROTEINS THAT 
INTERACT WITH RECEPTORS 


Coronavirus envelopes may exhibit either one or two envelope glycoproteins 
that interact with different cellular receptors. The virus attachment proteins 
are the spike glycoprotein (S), which forms the large peplomers characteristic of 
coronaviruses, and the hemagglutinin esterase (HE) glycoprotein, which forms 
short spikes in some coronaviruses in the MHV/OC43/BCV (bovine corona- 
virus) serogroup (Spaan et al., 1988; Sturman and Holmes, 1985, Vlasak et al., 
1988a,b). Aspects of the S and HE glycoproteins that may be important for 
receptor interactions are summarized briefly here. These envelope glycopro- 
teins will be discussed in detail in other chapters of this volume. 

The S glycoprotein forms a trimeric spike that is responsible for virus 
attachment to specific receptor glycoproteins, mediates virus-induced mem- 
brane fusion, and induces neutralizing antibody and cell-mediated immune 
responses (Daniel et al., 1993; Delmas and Laude, 1990; Rasschaert et al., 1987). 
The S glycoprotein of BCV, like its HE glycoprotein, can bind to 9-O-acetylated 
sialic acid residues on host cell macromolecules (Schultze et al., 1991a). Mono- 
clonal antibodies to S inhibit virus infection and/or virus-induced cell fusion, 
and virions that lack S protein are not infectious (Daniel et al., 1993; Holmes 
et al., 1981; Rasschaert et al., 1987). Sequencing of the genes encoding the S 
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glycoproteins of many coronaviruses indicates that there is a high degree of 
variability between different viruses and between different strains of the same 
virus (Cavanagh et al., 1988). Newly synthesized S proteins of many corona- 
viruses can be cleaved at a cluster of basic amino acids near the center of the 
protein by trypsin or related host proteases associated with the Golgi apparatus 
to yield two subunits called S1 and S2 (Lai, 1990; Spaan et al., 1988). S1, the 
amino-terminal subunit located on the tip of the spike, exhibits a high degree of 
variability. S2, the carboxy-terminal subunit, is predicted to include the stalk of 
the spike, the transmembrane, and intracytoplasmic domains. The extent of S 
cleavage depends on the type of coronavirus, the virus strain, and the host cell 
used to grow the virus. For several coronaviruses including BCV and rat corona- 
virus (RCV), addition of trypsin to the culture medium increases cell-fusing 
activity, infectivity, and/or plaquing ability (Gaertner et al., 1991; Storz et al., 
1981). Mutations at the cleavage site of the S glycoprotein of MHV are associ- 
ated with reduced virulence, delayed cell fusion, and virus persistence (Gal- 
lagher et al., 1991; Kant et al 1992). Large insertions or deletions that occur near 
specific sites in S1 of some coronaviruses have been associated with altered 
receptor interactions, different tissue tropism, and altered virulence (Gallagher 
et al., 1990; La Monica et al., 1991; Wang et al., 1992). For example, two closely 
related porcine coronavirus strains, transmissible gastroenteritis virus |(TGEV] 
and porcine respiratory coronavirus (PRCV), that differ markedly in the lengths 
of their S1 glycoproteins cause either enteric or respiratory disease, respectively 
(Laude et al 1993). 

The HE glycoprotein found on some coronavirus envelopes is a 120 to 140- 
kDa disulfide-linked dimer of 60 to 65-kDa monomers that forms a fringe of 
short spikes (Herrler et al., 1991; Vlasak et al., 1988a). HE is encoded in the 
genomes of coronaviruses in the serogroup that includes MHV, BCV, OC43, 
hemagglutinating encephalitis virus (HEV), and turkey coronavirus (TCV), but 
not in the genomes of the avian infectious bronchitis virus (IBV) or HCV229E 
groups of coronaviruses (Dea et al., 1989; Hogue and Brian, 1986; Schultze et al., 
1990; Vautherot et al., 1992). Whether or not full-length HE protein is expressed 
in cells infected with a particular virus strain is determined by the availability 
of an mRNA with a 5'-terminal HE gene and by initiation and termination 
codons in the gene. Thus, HE protein is found on envelopes of some MHV 
strains but not others (Yokomori et al., 1991). Coronavirus HE is closely related 
in amino acid sequence to the HE glycoprotein of influenza C (Herrler et al., 
1991; Luytjes et al., 1988). These HE proteins bind to 9-O-acetylated sialic acid 
receptor moieties, Neu5,9Ac,, that are found on glycoproteins, glycolipids, and 
gangliosides on membranes of erythrocytes, enterocytes, brain, and other cell 
types in vivo and on cell lines from several species (Herrler et al., 1987, 1991; 
Varki, 1992, 1993). Acetylation at the 9 position of sialic acid is developmentally 
regulated in various cell types of different species. The acetyl-esterase of HE 
releases the acetyl groups from the neuraminic acid, destroying virus-binding 
activity (Schultze et al., 1991b). Treatment of virions with pronase selectively 
removes HE and inhibits hemagglutination (Dea et al., 1989; Hogue and Brian, 
1986; King et al., 1985). Monoclonal antibodies to the HE glycoproteins can 
inhibit hemagglutination, neutralize virus infectivity, and inhibit esterase ac- 
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tivity (Parker et al., 1989, 1990; Storz et al., 1991; Vautherot et al., 1992; Yoo 
et al., 1992). 


III. CORONAVIRUS BINDING AND PENETRATION 


Coronavirus envelope glycoproteins may interact with specific cell mem- 
brane receptors at several steps during the virus replicative cycle. To initiate 
infection, virions bind to cell membrane receptors by either the S or HE glyco- 
protein, or possibly by sequential binding of HE followed by binding of S. Next, 
conformational changes in either the viral attachment glycoprotein, S, or the 
receptor, or both may be induced by virus binding and lead to fusion of the viral 
envelope with host cell membranes. This membrane fusion event has similar 
properties to the cell-cell fusion that occurs in cell cultures or tissues of 
animals infected with some coronaviruses (Barthold, 1986). MHV-A59-induced 
fusion is optimal at neutral or mildly alkaline pH (Sturman et al., 1990], like 
that induced by Sendai and HIV, which fuse at the plasma membrane (White, 
1990). For other coronaviruses, such as HCV-229E, IBV, or some MHV variants, 
infected cells generally do not fuse at neutral pH, and the viral envelope may 
fuse with endosomal membranes rather than the plasma membrane (Gallagher 
et al., 1991}. After synthesis of viral genomic RNA and proteins, virions mature 
by budding at intracellular membranes in a special pre-Golgi compartment and 
are transported through the Golgi apparatus into post-Golgi vesicles that prob- 
ably fuse with the plasma membrane to release virions from infected cells (Grif- 
fiths and Rottier, 1992; Tooze and Tooze, 1985). Newly synthesized cellular 
glycoproteins that serve as virus receptors are probably also present in the endo- 
plasmic reticulum and Golgi membranes, so that interactions of envelope glyco- 
proteins with nascent receptors could occur in these compartments. Corona- 
virus virions tend to adhere to the plasma membrane of the host cell following 
release, forming dense mats of adsorbed virions (Oshiro, 1973), which suggests 
that elution of virions from receptors is inefficient. The viral spike glycopro- 
tein, S, which is not incorporated into virions at intracellular membranes, is 
transported to the plasma membrane where it can bind to specific virus recep- 
tors on adjacent uninfected cells and initiate cell-cell fusion. 

Attachment and penetration are complex processes that have not yet been 
studied in great detail for most coronaviruses. Initial binding of a single HE or S 
glycoprotein to a receptor may be quickly followed by the recruitment of 
additional receptors beneath the virion. Cooperative interactions of many enve- 
lope glycoproteins with receptors could prolong adherence of the virion to the 
plasma membrane. Subsequent fusion of the viral envelope with the plasma 
membrane or endosomal membrane may depend on the affinity of S for its 
receptor, the fusing activity of the S glycoprotein, and the lipid composition of 
the cell membrane. The early events in coronavirus-cell interactions are tech- 
nically difficult to study due to the high ratio of noninfectious to infectious 
virions in coronavirus preparations and the tendency of S1 to detach from the 
virions (Boursnell et al., 1989; Cavanagh et al., 1986; Sturman et al., 1990). 
Studies on the interactions of purified virus attachment proteins with purified 
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recombinant receptor proteins will provide additional useful information about 
receptor affinities. 


IV. CARCINOEMBRYONIC ANTIGEN-RELATED RECEPTORS 
FOR MHV 


As shown in Table I, MHV naturally infects only mice, although young rats 
can be infected following intracerebral inoculation. Inbred mouse strains show 
varying degrees of susceptibility to different MHV strains. Princeton mice (PRI) 
are fully susceptible and die from MHV-2 infection (Bang and Warwick, 1960). 
Similarly, BALB/c, C57B1/6, and DBA mice are fully susceptible to fatal hepa- 
titis from MHV-3 infection (Le Prevost et al., 1975). In contrast, C3H mice are 
semiresistant to MHV-2, which produces lower virus titers and only mild or 
inapparent disease, and to MHV-3, which gives adult C3H mice chronic neuro- 
logical disease but not hepatitis (Virelizier et al., 1975). A/J mice, which are also 
semiresistant to MHV-3, do not develop disease, although virus titers in the 
liver are comparable to those of BALB/c mice (Dindzans et al., 1986; Le Prevost 
et al., 1975). Of all inbred mouse strains tested, only SJL mice are resistant to 
MHV-JHM-induced encephalomyelitis, and peritoneal macrophages from SJL 
but not other mouse strains are resistant to infection with MHV-JHM and 
MHV-A59 (Knobler et al., 1984; Stohlman and Frelinger, 1978). 

To determine whether receptors play a role in the mouse strain specificity 
and tissue tropism of MHV infections, Boyle et al., (1987) studied binding of 
MHV virions to membranes from murine enterocytes or hepatocytes, which are 
the natural target tissues of MHV. In a solid-phase assay, MHV-A59 bound to 
membranes from MHV-susceptible BALB/c mice, but not to membranes from 
MHV-resistant SJL mice. This assay also showed that binding of MHV-A59 to 
intestinal brush border membranes is highly species specific, and no virus 
binding could be detected on membranes from the rat, cat, dog, pig, human, 
rabbit, or chicken (Compton et al., 1992). A virus-overlay protein blot assay 
showed that in BALB/c liver and intestine only one 110 to 120-kDa BALB/c 
membrane glycoprotein, called the mouse hepatitis virus receptor (MHVR], 
bound MHV-A59 virions (Boyle et al., 1987). Virus binding to proteins from SJL 
membranes was not detected. An anti-receptor monoclonal antibody, MAb- 
CCl, was generated by immunization of SJL mice with BALB/c intestinal brush 
border membranes. MAb-CCI bound to the 100 to 120-kDa MHVR and to a 55 to 
58-kDa glycoprotein in immunoblots of BALB/c liver and intestine membranes, 
blocked binding of MHV to cultured BALB/c or C3H fibroblasts, and protected 
them from infection with MHV-A59 and other strains of MHV (Dveksler et al., 
1991, 1993a; Williams et al., 1990). Binding of radiolabeled MAb-CCI to mem- 
brane preparations from different mouse tissues showed that the highest levels 
of MHVR protein were on colon, small intestine, and liver (Williams et al., 
1991). MHVR was not detected on other BALB/c tissues or on any SJL/J tissues 
by this MAb-CCI binding assay. Thus, the small intestine and liver, which are 
major target tissues for MHV infection in vivo, express the highest levels of 
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MHVR. The mouse strain-dependent expression of MHVR also correlates with 
the susceptibility of different mouse strains to MHV infection. 

Immunoaffinity chromatography of Swiss-Webster mouse liver using MAb- 
CCl yielded a mixture of a 100- to 120-kDa and a 55- to 58-kDa glycoprotein 
(Williams et al., 1990). Comparison of the N-terminal amino acid sequence of 
the 100- to 120-kDa glycoprotein, called MHVR, with known sequences re- 
vealed strong homology with members of the carcinoembryonic antigen [CEA] 
family of glycoproteins in the immunoglobulin superfamily (Williams et al., 
1991). The sequence of a full-length cDNA clone encoding MHVR predicted a 
mature MHVR protein of 424 amino acids with 4 immunoglobulinlike domains, 
a transmembrane domain, and a short intracytoplasmic domain (Dveksler et 
al., 1991). The protein is highly glycosylated, with 16 predicted N-linked glyco- 
sylation sites. The coding sequence of MHVR is identical to that of mmCGMI 
(BgpA), a murine CEA-related glycoprotein in the biliary glycoprotein (BGP) 
subgroup (Beauchemin et al., 1989a). Expression of MHVR in a vaccinia virus 
vector yielded a recombinant glycoprotein that binds MHV and MAb-CCI. 
Human cells and hamster cells are resistant to infection with MHV-A59 vir- 
ions, but when transiently or stably transfected with cDNA encoding MHVR, 
they become susceptible to MHV-A59 and all other strains of MHV tested. 
Infection of these MHVR-transfected cells can be blocked by pretreatment of 
the cells with MAb-CCI (Dveksler et al 1991, 1993a). These data show conclu- 
sively that MHVR is a functional receptor for MHV and suggest that the block 
in MHV replication in cells of non-murine species is in receptor-dependent 
virus attachment and entry. 

On the MHVR glycoprotein, the binding sites for the S glycoprotein of 
MHV and for MAb-CC1 are located on the N-terminal Ig-like domain (Dveksler 
et al., 1993b). Several other members of the immunoglobulin superfamily of 
glycoproteins serve as receptors for viruses, including ICAM-1 for human rhino- 
viruses, CD4 for HIV, and poliovirus receptor for polioviruses (White and Litt- 
man, 1989]. For each of these receptors, as for MHVR, the virus binds to 
determinants on the N-terminal immunoglobulinlike domain. 

In addition to MHVR, liver and enterocyte membranes from BALB/c mice 
and other MHV-susceptible strains contain a 55- to 58-kDa glycoprotein that is 
recognized by MAb-CCI (Williams et al., 1990). From these tissues, cDNAs 
derived from several alternatively spliced transcripts of MHVR were cloned and 
sequenced. The 55 to 58-kDa glycoprotein is encoded by one of these transcripts 
which includes only the N-terminal and fourth Ig-like domain and trans- 
membrane and cytoplasmic domains (Dveksler et al., 1993a). The second and 
third Ig-like domains were deleted by splicing. Therefore, this protein was 
called MHVR(2d) to distinguish it from the original receptor, which is now 
called MHVR|4d] (Dveksler et al., 1993a). Additional splice variants generated 2 
and4 domain proteins with longer cytoplasmic domains. When transfected into 
hamster fibroblasts, each of these new MHVR-related cDNAs encodes an 
MHVR glycoprotein isoform that is recognized by MAb-CCI and that makes 
the cells susceptible to infection with MHV-A59 and other strains of MHV, as 
evaluated by development of viral antigens in the cytoplasm of cells, or by 
synthesis and release of infectious virions (Dveksler et al., 1993a; Yokomori and 
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Lai, 1992a,b). Thus, in MHV-susceptible strains of mice there are at least four 
alternatively spliced transcripts of the Bgp gene that encode functional MHV 
receptors. 

MHV-resistant adult SJL/J mice were studied to determine whether they 
express proteins homologous to MHVR. A polyclonal rabbit antibody directed 
against the 15 amino acids at the amino-terminus of MHVR and a polyclonal 
antibody directed against human CEA both recognized 55-kDa and 105- to 115- 
kDa glycoproteins in SJL/J intestinal brush border membranes (Williams et al., 
1990). In addition, reverse transcriptase polymerase chain reaction (RTPCR), 
Northern blot, and RNase protection assays were used to characterize cDNAs 
from SJL/J mice and outbred CD-1 mice that encode glycoproteins homologous 
to MHVR (Dveksler et al., 1993a; Yokomori and Lai, 1992a,b). Sequencing of 
these cDNAs revealed major differences in the amino acid sequence of the 
N-terminal domain between MHVR and its SJL/J homologue, as well as minor 
differences in other domains. One of three potential N-linked glycosylation 
sites from the N-terminal Ig domain of MHVR was missing in the SJL homo- 
logue, which could result in the slightly smaller size of the resulting glycopro- 
teins. The coding sequence of the SJL homologue is identical to that of 
mmCGM2 derived from outbred CD-1 mice (Beauchemin et al., 1989b; Mc- 
Cuaig et al., 1993). Both 2 domain and 4 domain isoforms of mmCGM2 were 
identified, and a transcript encoding an mmCGM2 with a longer cytoplasmic 
domain was cloned. No MHVR transcripts or genes could be found in SJL 
tissues by RTPCR or RNase protection assays, which indicates that MHVR and 
mmCGM 2 are alleles of the same murine Ben gene (Dveksler et al., 1993a; 
Nedellec et al., 1993). None of the mmCGM2 isoforms was recognized by MAb- 
CCl, and none bound MHV-A59 virions in solid phase binding assays or virus 
overlay protein blot assays (Dveksler et al., 1993a; Williams et al., 1990]. There- 
fore, it was surprising that each of the recombinant mmCGM2 isoforms when 
expressed in hamster fibroblasts made the cells susceptible to infection with 
MHV-A59 (Dveksler et al., 1993a). Differences in the characteristics of the 
N-terminal Ig domains of the BALB/c and SJL isoforms could account for the 
observed differences in virus and MAb-CCI binding and receptor activity be- 
tween MHVR and mmCGM2 isoforms. Transfection of mmCGM2 cDNA into 
SJL cell lines made them susceptible to MHV-A59 infection (Dveksler et al., 
1993a), suggesting that the block in coronavirus replication in these cells is at 
an early step in virus attachment or entry. The Bgp glycoproteins expressed in 
adult SJL mice may have weaker MHV-binding activity than the MHVR glyco- 
protein, and overexpression of a recombinant SJL protein may at least partially 
compensate for its weak binding activity. Direct measurement of affinity be- 
tween MHV spike glycoproteins and receptor isoforms will be required to test 
this hypothesis. 

Several cellular functions of the Bgp protein isoforms of mice and rats have 
been identified. These proteins may bind calmodulin and can serve as calcium- 
independent or calcium-dependent homophilic cell adhesion molecules (Culic 
et al., 1992; Turbide et al., 1991). The rat Bgp protein has ecto-ATPase activity 
(Lin and Guidotti, 1989}, and Bgp proteins act as bile acid transporters in the 
liver (Sippel et al., 1993). The expression of Bgps and other CEA-related glyco- 
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proteins is regulated in a tissue-dependent manner during development, and is 
often elevated in transformed cells (Beauchemin et al., 1987). Consequently, a 
single cell type from an outbred CD-1 mouse or an SJL X BALB/c hybrid mouse 
may express several isoforms of MHVR as well as several isoforms of mmCGM2. 
How the expression of each of these related glycoproteins is regulated, how the 
proteins are targeted, and how they interact with each other or with other cell 
membrane molecules are not yet known. Immunolabeling of BALB/c tissues 
with MAb-CCI shows that large amounts of MHVR antigen are expressed in 
BALB/c mice on the intestinal brush borders membranes, respiratory epithe- 
lium, bile canaliculi, endothelial cells, and proximal tubules of the kidney 
(Godfraind, in preparation]. MHVR has also been detected by fluorescence 
activated cell sorting (FACS) analysis on B cells, macrophages, and fibroblast 
cell lines (Coutelier et a1., 1994). Thus, large amounts of receptor glycoproteins 
are expressed on the membranes of the respiratory and enteric tracts where 
virus enters the body and on hepatocytes which are a major target tissue for 
MHV infection, and receptor is also expressed on macrophages and endothelial 
cells which could play a role in spread of infection by viremia. It is not yet clear 
whether the expression of different levels of receptor isoforms in different 
tissues affect the tissue tropisms of diverse MHV strains. 


V. AMINOPEPTIDASE N RECEPTORS FOR TGEV AND HCV- 229E 


Coronaviruses serologically related to HCV-229E include canine corona- 
virus (CCV), feline infectious peritonitis virus (FIPV) and feline enteric corona- 
virus (FECV], and TGEV and PRCV. Unlike MHV, which binds only to intesti- 
nal brush border membranes from MHV-susceptible strains of mice, HCV-229E 
and serologically related coronaviruses bind to receptors on brush border mem- 
branes of several unrelated species (Table III) (Compton, 1988; Compton et al., 
1992). The species specificity of infection with these viruses appears to be 
determined by a step subsequent to virus binding. 

The strategy used to identify cellular receptors for members of the HCV- 
229E group of coronaviruses was similar to that used to identify the MHV recep- 
tors. Monoclonal antibodies directed against membranes of virus-susceptible 
cells were prepared and tested for the ability to block virus infection, and the 
protein precipitated by that antibody was identified, cloned, and expressed in 
foreign species to be tested for virus receptor activity (Delmas et al., 1992; 
Yeager et al., 1992]. Several MAbs directed against porcine cell membrane 
proteins were found to prevent infection with TGEV (Delmas et al., 1992). 
These antibodies immunoprecipitated a 130- to 150-kDa membrane glycopro- 
tein. N-terminal amino acid sequencing of the immunoprecipitated protein 
identified it as porcine aminopeptidase N (APN), a metalloprotease that is a 
class 2 glycoprotein (Ashmun and Look, 1990; Look et al., 1986). Direct binding 
of TGEV to purified porcine APN (pAPN) was demonstrated (Delmas et al., 
1992). When the cDNA encoding pAPN was cloned and expressed in TGEV- 
resistant canine cells, the cells became susceptible to TGEV infection (Delmas 
et al., 1992). Thus, pAPN is a functional receptor for TGEV. 
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TABLE III. Binding of Coronaviruses to Intestinal 
Brush Border Membranes of Different Species 


Source of brush border membranes 


Virus Mouse Human Dog Cat Pig 
MHV ++4 — — — — 
HCV-229E + ++ + + + 

CCV — FF ++ ++ ++ 
FIPV — ++ ++ ++ ++ 
TGEV — ++ ++ + ++ 


4Boldface indicates binding to membranes from normal host; non- 
boldface indicates binding to membranes from a foreign species. 
++, Strong binding; +, moderate binding; +, weak binding; —, no 
detectable binding. 


Similarly, a monoclonal antibody, MAb-RBS, directed against membrane 
glycoproteins on human cell lines was found to block HCV-229E infection of 
human WI-38 fibroblasts and HL-60 cells (Yeager et al 1992). This MAb also 
immunoprecipitates a 150-kDa glycoprotein that is recognized by MAbs di- 
rected against human APN (hAPN|, which is also called CD13 (Look et al., 
1989). Murine fibroblasts are normally resistant to infection by HCV-229E, but 
they become susceptible to the virus following transfection with a cDNA clone 
that encodes hAPN, and infection is blocked by MAb-RBS (Yeager et al 1992). 
Interestingly, although hAPN is a functional receptor for HCV-229E, it is not a 
good receptor for TGEV which is serologically related (Holmes, in preparation]. 
This observation suggests that other species-specific host factors may block 
critical steps in virus replication that follow virus binding to APN to determine 
the species specificity of infection with this group of coronaviruses. 

The amino acid sequences and domain structures of APN and Bgp are 
unrelated, although both cellular glycoproteins can serve as receptors for one or 
more coronaviruses. APN is a zinc-binding protein that is found on the plasma 
membrane as a homodimer with its N-terminal domain anchored in the cyto- 
plasm (Ashmun and Look, 1990). A short transmembrane region is followed by a 
stalk domain, and the catalytic site of the enzyme is on its globular carboxy- 
terminal domain near a HELAH consensus sequence. 

In vivo, APN is found on enterocyte brush border membranes where it 
removes the N-terminal amino acids from oligopeptides generated in the gut 
lumen by endopeptidases. APN is expressed on myeloid tumor cells and may 
play a role in metastasis of melanoma, and APN at synaptic junctions helps to 
degrade neuroactive peptides (Ashmun and Look, 1990; Hersh, 1985). Because 
the S glycoproteins of some coronaviruses require protease cleavage to activate 
cell fusion and/or viral infectivity, it was important to determine whether the 
protease activity of APN plays a role in its receptor function. Several lines of 
evidence suggest that the S glycoprotein of HCV-229E may bind at or near the 
catalytic site of hAPN. MAb-RBS, which blocks virus binding to hAPN, also 
inhibits enzyme activity (Yeager et al., 1992]. A cDNA that encodes hAPN 
containing a 39 amino acid deletion that includes the HELAH sequence en- 
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coded a protein that was not recognized by MAb-RBS and could not serve as a 
receptor for HCV-229E when expressed in mouse fibroblasts (Ashmun et al., 
1992; Yeager et al., 1992). Finally, chelation of zinc inactivated both the enzy- 
matic activity of hAPN and its HCV-229E-receptor activity (Yeager et al., 1992). 
Other evidence strongly suggests that the protease activity of hAPN or pAPN is 
not required for its coronavirus receptor function. Small inhibitors of the pro- 
tease, actinonin and bestatin, did not block HCV-229E or TGEV infection 
(Delmas et al., 1992; Yeager et al., 1992), and point mutations that abolish 
enzyme activity did not reduce virus receptor activity (Yeager et al., 1992; 
Ashmun and Holmes, in preparation). Recently, it has been shown that deter- 
minants that are essential for the TGEV-receptor interaction reside within a 
domain of pAPN that is distinct from the enzymatic site (Delmas et al., 1994). 
In summary, the serologically related coronaviruses TGEV and HCV-229E, 
which cause disease in porcine intestine and human respiratory tract, respec- 
tively, both utilize APN glycoproteins of their normal host species as receptors. 
The APN glycoproteins appear to be used as receptors because of their expres- 
sion on respiratory and intestinal epithelial cell membranes, and their enzy- 
matic activity does not appear to play a role in coronavirus infection. 


VI. CARBOHYDRATE RECEPTORS FOR CORONAVIRUSES 


The observation that the HE glycoproteins of several coronaviruses in the 
MHV/BCV/OC43 serogroup bind to Neu5,9Ac, residues on erythrocytes like 
the HE glycoprotein of influenza C, suggested that this carbohydrate moiety 
might serve as a receptor determinant for these coronaviruses (Schultze et al., 
1990, 1991b; Vlasak et al., 1988a,b). The acetyl esterase activities of the HE 
glycoproteins of BCV, HCV-OC43, and HEV, like that of influenza C virus, 
cleaves acetyl groups from Neu5,9Ac,, rendering the erythrocytes resistant to 
agglutination by these viruses. Because 9-O-acetylated neuraminic acid moi- 
eties are found on a variety of cell types in many species (Varki, 1992, 1993), any 
coronaviruses that use this carbohydrate as their only receptor would be ex- 
pected to have broader host range and tissue tropism than those of corona- 
viruses that use species-specific determinants of a particular glycoprotein such 
as murine CEA-related glycoproteins or APN. Several observations on BCV 
support this hypothesis. BCV, which expresses high levels of HE and esterase 
activity (Schultze and Herrler, 1992; Vlasak et al., 1988a,b], infects cultured 
cells and tissues of more species than most other coronaviruses (Tables I and II). 
Pretreatment of several BCV-susceptible cell lines with neuraminidase or 
acetylesterase rendered the cells resistant to BCV infection, and resialylation 
with Neu5,9Ac, but not with nonacetylated sialic acids, restored susceptibility 
to BCV infection (Schultze and Herrler, 1992). Treatment of BCV virions with 
diisopropyl fluorophosphate, which inhibits the esterase activity of the HE 
glycoprotein, drastically reduces BCV infectivity (Vlasak et al., 1988b). 

Although coronavirus HE may facilitate virus binding to cells, interaction 
of the viral S glycoprotein with a specific glycoprotein receptor may be required 
for virus penetration. This hypothesis has not yet been tested directly because 
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viruses that express HE but not S glycoprotein are not available. However, MHV 
variants that express S but not HE are infectious in vitro and in vivo, indicating 
that HE is not essential for MHV infectivity (La Monica et al., 1991; Yokomori et 
al., 1991). Therole of HE expression in determining coronavirus virulence is not 
yet understood. Because the purified S glycoproteins of BCV and HCV-OC43, in 
addition to the HE glycoproteins, bind Neu5,9Ac, (Kunkel and Herrler, 1993; 
Schultze et al., 1991a), the experiments on modulation of virus susceptibility by 
removal or replacement of Neu5,9Ac, must be interpreted with caution. It is 
possible that some coronavirus virions first bind to cells by interaction of HE 
with 9-O-acetylated neuraminic acid residues on any macromolecule, and then 
their S glycoproteins bind to Neu5,9Ac, or another moiety on a specific receptor 
glycoprotein, leading to virus penetration and uncoating. 

The expression of HE glycoprotein could affect the entry of coronaviruses 
into cells in several ways. HE might mediate initial attachment to cells and 
stabilize virus binding until S interacts with protein receptors to initiate mem- 
brane fusion. If the esterase activity of HE can cleave Neu5,9Ac, on mucins or 
on cells such as erythrocytes that lack the capability to internalize virus, then 
expression of HE would increase the probability that a virion would reach 
receptors on susceptible cells. HE in the Golgi apparatus of infected cells might 
cleave acetyl residues from Neu5,9Ac, on nascent cell membrane molecules, 
resulting in cells deficient in Neu5,9Ac,. This might facilitate release of virions 
from the infected cells. 

The coronaviruses, like picornaviruses and retroviruses, have evolved to 
utilize a variety of different receptor determinants for entry into susceptible 
cells. The characterization of receptor-binding sites on viral envelope glycopro- 
teins and the selection and characterization of mutant viruses that utilize 
different receptors may elucidate the molecular mechanisms that determine 
coronavirus tissue tropism and species specificity. 
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CHAPTER 5 


The Coronavirus Surface 
Glycoprotein 


DAVID CAVANAGH 


I. PHYSICOCHEMICAL PROPERTIES 
A. Electron Microscope Observations 


Coronaviruses are frequently claimed to have a characteristic morphology, 
including the possession of a "club-shaped" surface projection or spike (S) 
glycoprotein. However, in common with other aspects of the coronaviruses, the 
group exhibits variation with respect to the shape, size, and distribution of the 
S protein on the virion surface. Davies and Macnaughton (1979) described the 
spikes of infectious bronchitis virus (IBV) and human coronavirus (HCV) 229E 
as being “tear-drop” shaped and widely spaced, whereas those of murine hepa- 
titis virus (MHV) type 3 were mostly “cone-shaped” and closely spaced, al- 
though in some MHV-3 preparations the spikes were more bulbous. Dimen- 
sions of S vary not only among the coronaviruses but also depending on the 
staining procedure; following potassium phosphotungstate staining all three 
viruses had spikes approximately 20 nm long and 10 nm wide at the bulbous 
end, except for the cone-shaped spikes of MHV, which had a diameter of only 
5 nm (Davies and Macnaughton, 1979}. The entire S protein has been observed 
after solubilization and purification (Sturman et al., 1980; Cavanagh, 1983c}. 
The nonenvelope-associated S1 subunit of the IBV S protein can become de- 
tached from the virion (Stern and Sefton, 1982a; Cavanagh and Davis, 1986). 
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B. Sedimentation Characteristics 


Purification of the S protein of MHV, IBV, HCV strain 229E, bovine corona- 
virus (BCV), and porcine hemagglutinating encephalomyelitis virus (HEV) has 
been achieved using a combination of nonionic detergent and sucrose gradient 
centrifugation (Sturman et al., 1980; Hasony and Macnaughton, 1981; Cav- 
anagh, 1983b; Schultze et al., 1990, 1991). When milligram quantities of IBV 
were used, it was necessary to dissociate and sediment the virus proteins in the 
presence of 1 M NaCl; otherwise the M protein cosedimented with S (Cavanagh, 
1983b). Spike has also been purified by affinity chromatography (Mockett, 1985; 
Daniel and Talbot, 1990). Sedimentation studies have been variously inter- 
preted as indicating that S from virions is a homodimer or homotrimer (IBV: 
Cavanagh, 1983c), homodimer (MHV: Vennema et al., 1990b), or homotrimer 
(TGEV: Delmas and Laude, 1990]. 


C. Electrophoretic Analysis 


It cannot be said that the S protein, of the coronaviruses as a group, exhibits 
characteristic migration in sodium dodecyl sulfate-polyacrylamide gel electro- 
phoresis (SDS-PAGE). The apparent molecular weight of uncleaved S is in the 
range 170—220,000 Da, depending on the virus. The extent of cleavage into 
amino-(N)terminal S1 and carboxy-(C}terminal S2 glycopolypeptides depends 
on the virus type, virus strain, and cell type used, and ranges from 0 to 100%. 
The two cleavage products also vary in molecular weight, from 84 to 135,000, 
and may be separated or comigrate in polyacrylamide gels. Examples of electro- 
phoretic analysis include BCV (Hogue et al., 1984; Deregt et al., 1987; Cyr-Coats 
et al., 1988; Vautherot et al., 1992a), MHV-]HM (Siddell, 1982], MHV-A59 (Frana 
et al., 1985), HCV-OC43 (Hogue et al., 1984), HEV (Callebaut and Pensaert, 
1980), IBV (Cavanagh et al., 1986a-c), turkey coronavirus (TCV) (Dea et al., 
1989a), canine coronavirus (CCV), and porcine transmissible gastroenteritis 
virus (TGEV) (Garwes and Reynolds, 1981), and porcine epidemic diarrhea virus 
(PEDV) (Egberink et al., 1988). Additional references are to be found in the 
review by Siddell et al. (1983). S2 of IBV may aggregate (Stern and Sefton, 1982a] 
and stain poorly with Coomassie Brilliant Blue (Cavanagh, 1983b), while others 
have reported high-molecular-weight forms of S that are irregularly obtained 
(e.g., Hogue et al., 1984). Care must be exercised, therefore, in the interpretation 
of polyacrylamide gel polypeptide profiles of coronaviruses that have not been 
extensively studied. 


Il. PRIMARY STRUCTURE 
A. Overall Features 


The S protein is a large glycoprotein that possesses an overall hydrophobic 
hydropathicity profile, a N-terminal signal sequence that is absent from the 


CORONAVIRUS SURFACE GLYCOPROTEIN 75 


mature protein, a C-terminal hydrophilic sequence preceded by a membrane- 
spanning domain, and a large number (up to 35) of potential N-linked glycosyla- 
tion sites, most of which appear to be occupied by glycans (Fig. 1). The entire S 
protein gene has been sequenced for many strains of IBV (Binns et al., 1985, 
1986; Niesters et al., 1986; Sutou et al., 1988; Jordi et al., 1989; Kusters et al., 
19892, 1990; Koch and Kant, 1990b; Cavanagh et al 1992b; Jia et al., 1993a,b], 
TGEV [Jacobs et al., 1987; Rasschaert and Laude, 1987; Britton and Page, 1990; 
Wesley, 1990], the TGEV variant porcine respiratory coronavirus (PRCV) (Ras- 
schaert et al., 1990; Britton et al., 1991; Wesley et al., 1991], the feline corona- 
viruses feline infectious peritonitis virus (FIPV) (de Groot et al., 1987a) and 
feline enteric coronavirus (FECV) (Wesseling et al 1994), HCV-229E (Raabe et 
al., 1990], CCV (Horsburgh et al., 1992; Wesseling et al., 1994}, PEDV (Duarte 
and Laude, 1994], several strains of MHV (Luytjes et al., 1987; Schmidt et al., 
1987; Parker et al., 1989; Gallagher et al., 1990a,b; La Monica et al., 1991; 
Taguchi et al., 1992), BCV (Abraham et al., 1990; Boireau et al., 1990; Parker et 
al., 1990; Zhang et al., 1991), and HCV-OC43 (Künkel and Herrler, 1993; Mounir 
and Talbot, 1993). 

There is great variation among the coronaviruses with respect to the size of 
the S polypeptide. Currently the shortest S protein known is that of IBV, with 
approximately 1160 amino acids, while FIPV has the longest protein of 1452 
amino acids (Fig. 1). The S protein of some, but not all, of the coronaviruses is 
cleaved to produce the N-terminal S1 and C-terminal S2 glycopolypeptides. 


B. Specific Features 


1. Glycosylation, Transport, and Maturation 


Another common feature of the spike glycoprotein of coronaviruses is the 
high degree of glycosylation, the number of potential glycosylation sites rang- 
ing from 21 (MHV) to 35 (FIPV) (Fig. 1) (see Section ILA for references). Both S1 
and S2 are glycosylated, e.g., IBV-Beaudette has 16 and 12 potential glycosyla- 
tion sites in S1 and S2, respectively. Comparison of the molecular weights after 
removal of the glycans and by comparison of the deduced molecular weight of 
the nascent polypeptide, obtained by sequencing, with that obtained from SDS- 
PAGE, indicates that most of these sites are occupied by glycans. 

That the glycans, cotranslationally added, are of the N-linked (via aspar- 
agine residues) type has been shown by their susceptibility to endoglycosidase- 
H (Stern and Sefton, 1982b; Cavanagh, 1983a) and tunicamycin (Holmes et al., 
1981; Rottier et al., 1981; Stern and Sefton, 1982b; van Berlo et al., 1987; Dea et 
al., 1989a,b). Indeed, following replication in the presence of tunicamycin, 
which prevents the attachment of N-linked glycans, virus particles lacking S 
protein are produced. Conversion of the initial high mannose or simple glycans 
to complex or hybrid molecules is a slow process, the half-life being one or 
several hours (Vennema et al., 1990b,c). The MHV S protein is formed initially 
as a glycosylated monomer of molecular weight 150,000, which then oligomer- 
izes slowly. The 150,000 glycopolypeptide is converted to a 180,000 form, 
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presumably involving modification of the simple glycans, after which cleavage 
into S1 and S2 occurs (Vennema et al., 1990b). In FIPV-infected cells, the oligo- 
merization process was observed to be slower (half-time of 40-60 min) than 
that of the G glycoprotein and hemagglutinin of vesicular stomatitis virus 
(VSV) and influenza virus, respectively, but faster than that of Rous sarcoma 
virus envelope glycoprotein (Vennema et al., 1990b). The initial high mannose 
(simple) glycans become converted to complex ones, the half-time being about 1 
hr in FIPV-infected cells (Vennema et al., 1990b,c). When S was expressed using 
vaccinia virus recombinants, the half-time of this process was approximately 3 
hr. Coronavirus budding occurs at pre-Golgi membranes (see Opstelten et al., 
1993, for references}, the conversion of simple glycans to complex ones occur- 
ring after the budding. Thus the S protein would appear to move faster through 
the Golgi apparatus when it is in virions than when it is not (Vennema et al., 
1990c}. 

Similar observations had been made following infection of cells with 
TGEV (Delmas and Laude, 1990). Oligomers were formed before the simple 
glycans were converted to complex ones, the oligomerization being considered 
to have taken place not beyond the cis Golgi compartment. Many S monomers 
failed to oligomerize. When tunicamycin prevented glycosylation, oligomeriza- 
tion was prevented, indicating that the lack of glycans caused aberrant folding 
of the S polypeptide. 

Although S1 and S2 are not linked by disulfide bonds, there are many 
intrachain disulfide bonds. The ectodomain of the S protein of MHV, for example, 
containes 42 cysteine residues. Opstelten et al. (1993) have studied the role of 
these disulfide bonds by incubating MHV-infected cells in the presence of 
dithiothreitol (DTT) to produce reducing conditions. The major conforma- 
tional events involving disulfide bonds took about 20 min. The process would 
appear to start on the nascent polypeptide. The DTT did not interfere with the 
functioning of the endoplasmic reticulum. Indeed, the membrane protein (MI 
was transported normally. Opstelten and colleagues (1993) state in their discus- 
sion that M and S form complexes almost immediately after synthesis, which 
accumulate at the site of budding (pre-Golgi membranes). Thus it may be the 
interaction of S with M that prevents M progressing beyond the virus-budding 
site. 

A small proportion of S does progress to the cell surface following corona- 
virus infection (Laviada et al., 1990) and expression of S involving vaccinia virus 
recombinants (Vennema et al., 1990c; Pulford and Britton, 1991; Oleszak et al., 
1992; Kubo and Taguchi, 1993) and baculovirus (Yoo et al., 1990; Godet et al., 
1991). The S1 subunit, expressed in the absence of S2 (which forms the 
membrane-anchoring stalk of S) was excreted. A truncated. TGEV S protein, 
which lacked the carboxy-terminal 292 amino acids, was retained in the endo- 
plasmic reticulum and did not progress to the cell surface (Pulford and Britton, 
1991). However, a mutant TGEV spike gene that lacked 70 amino acids from the 
carboxy-terminus was excreted from cells after expression from a baculovirus 
recombinant (Godet et al., 1991). A mutant MHV SI that contained the S2 
transmembrane domain was transported to and retained at the cell surface, as 
was a mutant S2 to which had been added the S1 N-terminal signal peptide. 
When “normal” S1 and also S2 were transiently expressed in the same cells, S1 
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was retained at the plasma membrane, suggesting that S1 and S2 had associated 
in a manner similar to that when the intact S gene was expressed (Kubo and 
Taguchi, 1993). 


2. Cleavage of the Propolypeptide 


Although the S protein of some coronaviruses is cleavable by cell proteases, 
to yield S1 and S2, the extent to which this occurs varies, depending on the type 
and strain of virus and the host cell used. Cleaved spike has never been observed 
with two coronaviruses, TGEV and FIPV (Garwes and Pocock, 1975; Egberink et 
al., 1988]. In the case of IBV strains from chicken embryos and chick kidney 
cells (Cavanagh and Davis, 1987; Stern and Sefton, 1982a), PEDV from infected 
pigs (Egberink et al., 1988), and TCV from human rectal carcinoma (HRT) cells 
(Dea et al., 19892], all or almost all of the virion-associated spike was in the form 
of S1 and S2. Virions of IBV-Beaudette from Vero cells (Cavanagh et al., 1986b| 
and BCV-Mebus from HRT cells (Hogue et al., 1984; Cyr-Coats et al 1988) had 
about 7096 and 5096 cleaved S, respectively. In contrast, almost none of the S of 
HCV-229E from human rhabdomyosarcoma cells (Schmidt and Kenny, 1982), 
MRC5 cells (Kemp et al., 1984), and human embryonic lung cells (Arpin and 
Talbot, 1990) and of HCV-OC43 (Hogue and Brian, 1986) was cleaved, although 
the latter studies have shown that S of HCV-OC43 can be completely cleaved by 
trypsin. The role played by the cell type is well illustrated with MHV-A59, 
where the extent of S cleavage varies from 0-10096, depending on the host cell 
used (Frana et al., 1985). 

Notwithstanding the requirement that cells have appropriate proteases with 
which to cleave S, there are features of some spike molecules that make them 
potentially more readily cleavable than the S of other coronaviruses. Thus 
sequencing of the S gene of many IBV strains has shown that the cleavage site is 
adjacent to a basic amino acid sequence, RRFRR, RRSRR, RRHRR, or GRHRR 
(Binns et al., 1985, 1986; Cavanagh et al., 1986b, 1988; Niesters et al., 1986; 
Kusters et al., 1989a). MHV strains JHM and A59 have basic connecting peptides 
comprising RRARR and RRAHR, respectively (Schmidt et al., 1987; Luytjes et 
al., 1987), and BCV has the sequence KRRSRR or KRRSVR at the corresponding 
location (Abraham et al., 1990; Boireau et al., 1990; Parker et al., 1990, Zhang et 
al., 1991). These basic sequences, which would be situated at the surface of the 
protein, would be expected to make the spike sensitive to those cell proteases 
that hydrolyze adjacent to basic residues. In the case of influenza A viruses 
those strains that have the hemagglutinin (HA) protein cleaved in many cell 
types have an HAL HAT) connecting peptide containing several basic residues, 
whereas those isolates in which cleavage occurs in fewer cell types have only a 
single basic residue at the cleavage site (Bosch et al., 1981). In keeping with this 
observation, the S of TGEV and FIPV, which has not been observed in a cleaved 
form, and that of HCV-229E, which is hardly cleaved, does not have any pairs of 
basic residues in the region analogous to the cleavage site of IBV and MHV 
(Rasschaert and Laude, 1987; Jacobs et al., 1987; de Groot et al., 1987a; Raabe et 
al., 1990). The capacity of the S protein of HCV-OC43 to be cleaved by trypsin is 
in keeping with the presence of the sequence RRSR, which is a potential 
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cleavage site (Künkel and Herrler, 1993; Mounir and Talbot, 1993). The require- 
ment for cleavage of S for the fusion process is discussed in Section V.3. 


3. Structure of S2 


The observation that urea removed S1 but no other polypeptide from IBV 
led to the suggestion that S was anchored in the envelope by S2 (Cavanagh, 
1983c]. Support for this view was provided by comparison of N-terminal and 
nucleotide sequencing data that showed that S2 was generated by cleavage of 
the S precursor and that near the carboxy-terminus of S2 there was a 44 residue, 
strongly hydrophobic sequence suggestive of a membrane-spanning domain 
(Binns et al., 1985; Cavanagh et al., 1986b). Analysis of the deduced amino acid 
sequences of the spike protein of IBV, MHV, and FIPV (de Groot et al., 1987b], 
supported by Rasshaert and Laude's (1987) deductions based on TGEV data, has 
revealed the presence of two regions in S2 with a seven-residue periodicity, 
forming heptad repeats (Fig. 1). These are indicative of a coiled-coil structure. 
Sedimentation studies have indicated that S is either a homodimer or homo- 
trimer. The results of a cross-linking study have favored the latter (Delmas and 
Laude, 1990). Currently, therefore, the oligomeric spike protein is envisaged as 
being anchored by an a-helical region near to the C-terminus of S2. Just beyond 
the outer membrane surface is the shorter (minor) repeat structure predicted to 
be an a-helix of 5 (IBV and MHV] or 7 nm (FIPV). The major repeat indicates a 
helix of some 10 (IBV and MHV) or 13 nm (FIPV) or more, which would span 
more than half of S (Fig. 1) (de Groot et al., 1987b). Thus S is envisaged as a 
multimeric protein, possibly a homotrimer, the narrow stalk of S being a 
complex coiled-coil structure, somewhat analogous to the hemagglutinin tri- 
mer of influenza virus (Wilson et al., 1981). Just before the membrane-spanning 
domain of S is the sequence KWP. Terminating 10 residues upstream of KWP isa 
leucine-zipper motif, the length of these varying from three to five heptad 
repeats (Britton, 1991]. 

S2, of IBV-M41 was susceptible to trypsin and other proteases, near its 
C-terminus, at residues close to the outer membrane surface (Cavanagh et al., 
1986a). The S1 subunit of IBV could be quantitatively released from S2 by 
treatment with urea (Cavanagh and Davis, 1986). Although urea did not have 
this effect on MHV-A59, incubation of MHV at pH 8.0 and 37 ?C did result in 
the release of S1 (Sturman et al., 1990; Weismuller et al., 1990). 


4. Conservation of S2 among Coronaviruses 


Comparison of the amino acid sequences of the spike proteins from IBV, 
MHYV, FIPV, and TGEV has shown that there is far greater sequence conserva- 
tion within the C-terminal, S2 part of the molecule than in S1 (Schmidt et al., 
1987; de Groot et al., 1987b; Rasschaert and Laude, 1987). Within S2 are regions 
exhibiting 30% identical residues (Fig. 1), while comparison of S2, of BCV strain 
F15 with that of MHV-A59 reveals about 70% homology (Fig. 4) (Boireau et al., 
1990). One notable feature, shared by the various coronaviruses, is that whereas 
the average cysteine content of S is about 3%, that in the C-terminal hydro- 
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philic tail, which is probably within the lumen of the virus, is around 24%. 
Some of these cysteine residues may be associated with fatty acid chains, 
known to be present on S2 of MHV (Schmidt, 1982; Sturman et al., 1985; van 
Berlo, 1987). 

All the coronavirus S genes analyzed encode an highly conserved eight- 
residue sequence Lys-Trp-Pro-Trp-Trp/Tyr-Val-Trp-Leu. The leucine-zipper 
motifs described in the previous section ends 10 residues before the Lys-Trp-Pro 
of this conserved sequence, the remainder of which probably forms the first part 
of the transmembrane domain (see Britton, 1991]. 


5. Extensive Variation of S1 among Coronaviruses 


In contrast to the modest conservation of S2, within the coronavirus genus, 
S1 shows very little amino acid identity when some pairs of viruses are com- 
pared, e.g., IBV with TGEV (Rasschaert and Laude, 1987), IBV with MHV 
(Schmidt et al., 1987), and MHV with FIPV (de Groot et al., 1987b) (Fig. 1). 
Indeed, the extensive heterogeneity exhibited by S1 extends to strains within 
some coronavirus species, notably IBV. In recent years there has been extensive 
analysis of the antibody-inducing epitopes of some coronaviruses, the S1 sub- 
unit being the major inducer of virus-neutralizing (VN] antibodies in some 
coronaviruses, e.g., IBV and TGEV. Since sequence differences in S1 are respon- 
sible for many of the antigenic differences between coronaviruses it is appropri- 
ate to discuss sequence and antigenic variation in the same section (Section III). 


III. STRUCTURAL AND ANTIGENIC VARIATION OF S 
A. S Protein Is a Major Inducer of Neutralizing Antibody 


All three surface glycoproteins of coronaviruses, S, M, and, when present, 
HE, induce VN antibody (see Chapters 6 and 8, this volume, for M and HE, 
respectively). Although there are fewer S than M and N molecules in the virion 
(Cavanagh, 1981], the spike protein induces a good immune response following 
infection. That S induces VN antibody has been shown in three ways. First, by 
immunization with purified S protein of TGEV (Garwes et al., 1978/79), MHV 
(Hasony and Macnaughton, 1981), and IBV (Cavanagh et al., 1984). Second, by 
immunization with vectors expressing the S gene of IBV (Tomley et al., 1987}, 
FIPV (de Groot et al., 1989], TGEV (Godet et al., 1991], and MHV (Daniel and 
Talbot, 1990; Wesseling et al., 1993). Third, by showing that many VN mono- 
clonal antibodies (MAbs) bound to the S protein of MHV (references in subse- 
quent sections]. 


B. Location of Antigenic Sites 


A number of approaches have been used to locate antigenic sites on S, all 
involving the use of MAbs: competition enzyme-linked immunosorbent assay 
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(ELISAs); production of polypeptides or peptides by proteolysis of S, prokaryotic 
expression vector or synthetically; and sequencing of MAb-resistant variants. 
No simple picture has emerged for the genus as a whole, except that S1 has more 
VN epitopes than S2 in some species. 

Earlier, polyclonal antibody analyses (for references, see Spaan et al., 1990) 
indicated that each coronavirus could be placed into one of four antigenic 
groups. Combining more recent biochemical, sequence, and MAb analyses 
would suggest three groups. In one group would be TGEV, FIPV, CCV, HCV-229E, 
and PEDV. A second group would contain MHV, BCV, HCV-OC43, and TCV, 
while IBV would constitute the sole member of the third group (Chapter 1, this 
volume}. It is instructive to compare the S protein of coronavirus species within 
these groups. The S protein differences observed among species within a group 
may play a role in host range and tissue specificity and certainly do so with 
respect to antigenic differences of practical importance. 


1. TGEV, FIPV, CCV, HCV-229E, and PEDV 


Four or five major antigenic domains were detected on the S protein of 
TGEV using competition assays with MAbs (Delmas et al., 1986; Jimenez et al., 
1986; Correa et al., 1988; Simkins et al., 1989; Enjuanes et al., 1990). Subse- 
quently, extensive analysis of the binding of MAbs to proteolytic fragments of S 
(Correa et al., 1990; Delmas et al., 1990a,b; Enjuanes et al., 1990), to S polypep- 
tides generated in Escherichia coli [Enjuanes et al., 1990; Delmas et al., 1990a,b], 
to synthetic peptides (Posthumus et al., 1990a,b; Gebauer et al., 1991), and 
sequencing of MAb-resistant (MAR) mutants (Delmas et al., 1990a,b; Enjuanes 
et al., 1990; Gebauer et al., 1991) has revealed that the antigenic makeup of S is 
complex. In addition, some sites would appear to be composed of sequences that 
are linearly far apart, but which are brought into close proximity by the folding 
of the protein, with one sequence playing a more prominent role than the other. 

The major research teams in this area have used different systems for 
naming antigenic domains (see Gebauer et al., 1991, for a comparison) and a 
common nomenclature, using Roman numerals, has been proposed (Post- 
humus et al., 1990a,b). I have used this nomenclature in Fig. 2 to show the 
approximate positions of the antigenic domains, taking into account data gen- 
erated by all the approaches referenced above. 

The SI part of S is the major inducer of antibodies and many of the anti-S1 
MAbs were VN. The major VN epitopes are in region I and are highly conserved 
among TGEV isolates. This, and the great difficulty experienced in obtaining 
MAR mutants to site I MAbs, has led to the conclusion that this region has an 
important function. Competition between MAbs and antibodies in sera raised 
following vaccination of pigs with TGEV confirmed the immunodominant 
nature of site A, in particular subsite antibody (Diego et al 1992). Sites II and IV 
are also associated with VN epitopes. Site III is exposed on denatured but not 
native S. Differentiation among strains of TGEV has been achieved by MAbs 
directed against epitopes at sites II, III, and IV (Welch and Saif, 1988; Sanchez et 
al., 1990). Some epitopes may be formed by residues contributed from sites I and 
II (Correa et al., 1990; Delmas et al., 1990a,b], while residues in S2 (1176-1184) 
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contribute in some way to the antigenicity of epitopes at site IV (Posthumus et 
al., 1990a,b, 1991). 

There were differences among the sites with respect to the influence of 
glycans. The binding of MAbs to site IV was little affected by the absence of 
glycans on S, whereas nonglycosylated/deglycosylated S did not bind site I and 
II MAbs, and site III was perhaps an intermediate case (Gebauer et al., 1991; 
Delmas and Laude, 1991). 

A very interesting variant of TGEV of economic importance was isolated in 
Europe in the mid-1980s (see Chapter 16, this volume). This variant was named 
porcine respiratory coronavirus because it replicated mainly in the respiratory 
tract of pigs and very little in the alimentary tract. Hence, it did not cause the 
enteric disease so characteristic of classical TGEV. Sequencing revealed muta- 
tions in gene 3 of PRCV, leading to loss of the first open reading frame, plus a 
large deletion of 672-681 nucleotides from near the beginning of the S gene 
(Rasschaert et al., 1990; Britton et al., 1991; Wesley et al 1991) (Fig. 3). Thus 
PRCV lacks antigenic sites II and III, and MAbs to these sites can be used to 
differentiate PRCV from TGEV (Garwes et al 1988; Callebaut et al., 1988; 
Enjuanes et al 1990; Simkins et al 1992). The remainder of S of PRCV and 
TGEV have 98% identity and, correspondingly, have site I and IV epitopes in 
common (Sanchez et al., 1990). 

The S proteins of TGEV and FIPV are almost identical in length, and for 
residues 275-1447 there is 94?6 identity and some common site I and IV epi- 
topes, including some that are VN and that mediate antibody-dependent 
enhancement of FIPV infectivity (see Chapter 14, this volume). In contrast, the 
first 274 residues have only 30% identity, and no common site II and III epitopes 
have been detected (Fig. 3) (Jacobs et al., 1987; Sanchez et al., 1990). 


PROV 77 — T 


225 98% 

rev [|= [a i ii ili iii] “ 
30% 274 94% 
81% 300 95% 

cv [De ei a w 
O = 60% 


ew DH Een = 


FIGURE 3. Amino acid sequence differences among the S proteins of members of the TGEV group 
of coronaviruses. The diagram shows the much greater variation among the amino-terminal 225— 
300 residues than among the remainder. The gaps between rectangles representing S of PEDV and 
HCV-229E show the approximate size and position of the largest deletions only. HCV-229E and the 
PRCV variant of TGEV lack the first 225 or so residues of the other species. The amino-terminal 
signal peptide is indicated by the lighter shading. 
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Feline coronaviruses could be divided into two antigenic groups on the 
basis of binding of anti-S MAbs (Fiscus and Teramoto, 1987a,b; Hohdatsu et al., 
1991). A feline coronavirus that did not cause FIP (FECV) had 90% and 97% 
amino acid identity within residues 1-300 and 301-1452, respectively, of a FIPV 
isolate (Wesseling et al., 1994). 

The CCV S protein is the same length as that of FIPV, and residues 300-1452 
have 95% identity (Horsburgh et al., 1992; Wesseling et al., 1993) and some site I 
epitopes in common, including VN epitopes (Fig. 3) (Sanchez et al., 1990; 
Hohdatsu et al., 1991). When the first 300 residues are compared, the identity 
with FIPV is still high, at 81% (Wesseling et al., 1993]; many isolates of one 
species, IBV, differ from each other by this amount in the first 300 or so residues 
(see Section II A Al None of the TGEV site II and III MAbs of Sanchez et al. 
(1990) bound to CCV, in keeping with the low (38%) identity between the first 
300 residues of these two viruses. 

The sequence of several genes of another porcine coronavirus, PEDV, have 
been determined recently and have confirmed that this virus more closely 
resembles the TGEV group than the other coronavirus groups. Although some- 
what shorter than that of TGEV, the 1383 residue PEDV S does include the 
N-terminal region that corresponds to the beginning of the TGEV S protein and 
is absent from PRCV (Fig. 3) (Duarte and Laude, 1994). However, there is only 
about 10% identity in this region when compared with TGEV, CCV, and FIPV. 
Not surprisingly, none of 25 anti-TGEV spike protein MAbs bound to PEDV 
(Sanchez et al., 1990). 

Finally, sequence analysis of HCV-229E has confirmed its relationship to 
the TGEV group, although, like PRCV, the 229E S protein lacks the first 225 or 
so amino acids (Fig. 3) (Raabe et al. 1990). The extent of identity of the 229E S 
protein to that of TGEV, FIPV, and CCV is similar to that exhibited by PEDV. 
Thus, the first 550 or so amino acids have about 38% identity, the C-terminal 
half being greater (57%). In contrast, the identity with the first half of S of MHV 
and IBV is only 16-18% and for the second half 32-35% (Raabe et al., 1990). 
Once again, none of the TGEV anti-S MAbs of Sanchez et al. (1990] bound to the 
229E S protein. 


2. MHV, BCV, HCV-OC43, HEV, and TCV 


Recent sequencing and MAb studies of these viruses have confirmed their 
much greater similarity to each other than to other coronaviruses. Comparison 
has revealed a hypervariable region within S1 that is associated with some of the 
antigenic differences not only between these virus species but also among 
strains of a given species (Fig. 4). This hypervariable region may also be associ- 
ated with host range and tissue tropism, although this is speculation. Sequence 
data are not available for the S protein of HEV and TCV but they are for the other 
members of this group; it is to these that the following discussion of sequence 
applies. 

The lengths of the S proteins are very similar, ranging from 1351/53 for 
HCV-0C43 to 1376 for MHV-4 (see Section ILA for references). The BCV S protein 
is most closely related to that of HCV-OC43 (91% identity] and less so with MHV 


CORONAVIRUS SURFACE GLYCOPROTEIN 85 


Amino acid sequence differences among the S proteins of the 'MHV' group 
S1 E» T Exe $2 


Ae 


500 U 


99% 





1376 


MHV-4 


1363 


BCV 


HCV-OC43 15 


FIGURE 4. Amino acid sequence differences among the S proteins of the MHV group of corona- 
viruses. The rectangles represent the whole spike protein, the numbering having been normalized 
withrespect to that of the longest known MHV spike protein, belonging to strain MHV4. The arrow 
at position 770 marks the cleavage site between S1 and S2. The numbers in the gaps show the size of 
deletions compared with MHV4. The percentage differences shown for the pairs MHV-A59/MHV- 
JHM and MHV-JHM/MHV-4 refer to comparisons of S1 and S2, excluding the deletions from S1. The 
percentage differences shown for the pairs MHV4/BCV and BCV/HCV-OC43 are for three areas, the 
middle one being the hypervariable region. 


(about 60%). HCV-OC43 also has approximately 60% identity to the S protein 
of MHV. As observed with the other coronviruses, identity is greater in S2 
(BCV:HCV, 95%; BCV:MHV, 7196) than in S1 (BCV:HCV, 8896; BCV:MHV, 55%). 

The very high identity shown by BCV and HCV-OC43 is interrupted by a 
135-residue sequence in S1 (residues 465-609 of HCV} in which the identity 
drops from >90% to 75% (Fig. 4). The similarly situated residues of BCV 
(residues 452-593) have <40% identity with the corresponding region of 
MHV-4 compared with >55% elsewhere in S (Fig. 4). Indeed, about half of the 
amino acid differences between S of MHV-4 and BCV occur within this region 
which accounts for only about 1296 of S. It is in this same region that strains of 
MHV differ dramatically from each other. Whereas most of the SI of MHV 
strains has about >90% identity, at this location there are deletions of a single 
stretch of amino acids as great as 159 residues (Taguchi et al., 1985; Luytjes et 
al., 1987; Morris et al., 1989; Parker et al., 1989; Taguchi and Fleming, 1989; 
Banner et al., 1990; Gallagher et al., 1990a,b; La Monica et al., 1991). It is likely 
that MHV-4 is a prototypic MHV strain, the commonly used A59 and JHM 
strains having been selected during passage in vitro. 

The S gene of six BCV strains, isolated in several countries in two conti- 
nents, have been sequenced, revealing a very high conservation (9896 overall) 
(Zhang et al., 1991). Most of the differences (45-56 bases, 16 to 26 amino acids) 
were between virulent and avirulent strains, compared with only 6-14 base 
differences («10 amino acids) among the four avirulent isolates. There were 
twofold more differences in S1 than S2, but the differences in S1 were not 
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concentrated in one region. A BCV isolate from nasal swabs of a calf suffering 
from respiratory disorders had a S protein that had 98% amino acid identity 
with that of enteric BCV isolates (Zhang et al., 1994). 

The first coronavirus to be extensively studied with respect to the location 
of its antigenic sites was MHV. A complex picture has emerged and might be 
summarized as follows. Following infection with MN, epitopes formed by the 
S] subunit induce antibodies with the highest VN activity. Most of these 
epitopes are conformational. In these respects MHV resembles BCV, in addition 
to viruses of the TGEV group (Section IILB.1) and IBV (Section III.B.3). Some 
epitopes associated with the S2 subunit of MHV induce VN antibodies, but 
these have usually been of low titer. Several of these epitopes are continuous 
and have been mimicked by peptides. There is evidence from competition 
experiments with MAbs that some antigenic sites are formed or at least influ- 
enced by residues contributed by both S1 and S2. 

Competition with MAbs revealed up to six epitope groups (Talbot et al., 
1984; Wege et al., 1984; Talbot and Buchmeier, 1985) of which two [sites A and B] 
or three (sites A, B, and C] were associated with VN antibodies. Cell fusion was 
inhibited by site A MAbs. Direct evidence that the major VN epitopes reside on 
S1 has been derived from analysis of the binding of MAbs to S1 in the absence of 
S2. S1 expressed using a baculovirus was able to bind anti-S1 VN MAbs (Taguchi 
et al., 1990; Takase-Yoden et al., 1990). Using the c1-2 variant of MHV-JHM, the 
S1 and S2 glycopolypeptides of which can be distinguished in SDS-PAGE gels, 
Kubo et al. (1993) showed that of 15 MAbs raised against S, 14 immunoprecipi- 
tated S1 after this had been dissociated from S2 by urea and 2-mercaptoethanol. 
Nine of the 14 MAbs were VN, only one bound to S1 after boiling in the presence 
of SDS and DTT, and five of the VN MAbs failed to immunoprecipitate S1 after 
only mild denaturation (SDS at room temperature without DTT). Taken to- 
gether, the MAb-binding studies of Takase-Yoden et al. (1990) and Kubo et al. 
(1993) show that some S1 residues form epitopes independently of S2 amino 
acids and that some of these epitopes are discontinuous in nature. 

Neutralizing MAbs 11F and 30B were induced by MHV-JHM sequences in 
bacterial fusion proteins and mapped to S1 residues 33—40 and 395—406, respec- 
tively (Fig. 5c] (Routledge et al., 1991). Competition experiments (Stuhler et al., 
1991) using purified MHV-JHM revealed that MAb 30B competed with highly 
neutralizing, discontinuous epitope MAbs of the site A described by Wege et al. 
(1984). This indicates that residues around amino acid position 400 of the MHV 
Sl protein form at least part of site A (Wege et al., 1984) and play a major role in 
the induction of VN antibodies. This location is adjacent to a highly variable 
region of S1 (Fig. 5). 

MAbs to sites B and C selected MHV-JHM variants that had large deletions 
in S1 (Talbot et al., 1984; Dalziel et al., 1986; Parker et al., 1989; Gallagher and 
Buchmeier, 1990; Gallagher et al., 1990; La Monica et al., 1991). This per se does 
not prove that the epitopes for these MAbs were located in this part of S1; the 
epitopes might have been situated elsewhere in S1 or in S2 and their configura- 
tion disrupted by the large deletion in S1. In this regard it is noteworthy that 
neither of the MAbs of Talbot et al. (1984) used to select the variants with the 
deletions in S1 could react with denatured S in Western blots, indicating that 
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FIGURE 5. Antigenic and other sites within the S protein of strains of MHV. The amino acid 
numbering has been normalized with respect to that of the longest known MHV protein, that of 
MHV4. The numbers in the gaps show the size of deletions copared with MHV4. The arrow at 
position 770 marks the cleavage site between S1 and S2. (a) Shows the location of antigenic domains 
I-V identified by Daniel et al. (1993) using bacterially expressed fragments of S of MHV-A59. [he 
small, filled rectangle represents the peptide of Luytjes et al. (1989) (residues 900-908), which bound 
the VN and anti-fusion Mab 19.2. The small, open rectangle represents the peptide of Talbot et al. 
(1988) (residues 1,135-1,144), which induced VN antibodies and protective immunity in mice. (b) S 
polypeptide of neurotropic variants of MHV-JHM isolated from the brain and spinal cord of JHM- 
infected rats (La Monica et al., 1991). (c) S of MHV-JHM showing three sites identified by the binding 
of bacterial expression products with VN MAbs 11F and 30B (MAb 11F also inhibits fusion) and with 
VN and antifusion MAb 10G (Routlege et al., 1991). The arrow indicates the positions (1250, 1251, 
and 1257] of amino acid substitutions in MHV-JHM MAR mutants selected by MAb LIF (Grosse and 
Siddell, 1994). (d) S of a MAR mutant of MHV4 selected with MAb V4B11.3 (Parker et al. 1989). (e) S 
of MHV4 showing the heptad repeat regions (stippled) and membrane-spanning region (solid box). 
The locations are shown of three mutations present in mutants of MHVA recovered from a persis- 
tently infected neural cell line, the mutants requiring a pH of 5.5-6.0 for membrane fusion 
(Gallagher et al., 1991). The arrows show the position of two point mutations present in a MAR 
mutant selected with MAb 7.2-V-1 (Wang et al 1993). 


the epitopes were of a discontinuous nature. Not all MAR mutants with muta- 
tions in S1 have deletions. For example, MAb 7.2-V-1 selected a MAR mutant of 
MHV-DL that had two point mutations, both in S1, at residues 588 and 650, the ` 
former within and the other very near the hypervariable region of MHV (Fig. 5e] 
(Wang et al., 1992). The hypervariable region is even larger than that indicated 
by comparison of MAR mutants, as La Monica et al. (1991) have shown with 
neurotopic variants of JHM isolated from the brain and spinal cord of JHM- 
infected Wistar Furth rats. These variants lacked 147 amino acids in addition to 
those already absent from JHM when compared with MHV4 (Fig. 5b). 

A number of studies have shown that the situation is more complex than 
the foregoing account would imply. MAb 11F, which was induced by residues 
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33-40 in a bacterially expressed product, was used to raise 10 MAR mutants. 
Sequencing of these showed that the single amino acid mutations were not 
located in residues 33-40 but at positions 1109, 1110, and 1116 of the JHM S 
protein (equivalent to residues 1250, 1251, and 1257 of MHV4; Fig. 5c) (Grosse 
and Siddell, 1994). These mutations did not affect the binding of many other S1 
and S2 MAbs, showing that the mutations had not caused extensive alterations 
to the structure of S. This implies some specific interaction between amino 
acids in region 33-40 of S1 and those around residues 1109-1116 of S2. 

Several other studies have shown the presence of epitopes on S2. A site A 
MAb that inhibited fusion reacted with an nine amino acid sequence, delin- 
eated using prokaryotic expression products and synthetic peptides, situated 
131 residues downstream from the N-terminus of S2 (Fig. 5a) (Luytjes et al 
1989). This region is particularly interesting because the MAb used, although 
neutralizing, could not select MAR mutants (Talbot and Buchmeier, 1985). 
Moreover, this site was highly conserved among MHV strains. Taken together, 
these observations suggest that this region of S may have a vital function for the 
virus. The MAb directed against this peptide was able to protect mice from 
lethal infection (Koolen et al., 1990]. The peptide has also been expressed as a 
chimeric protein and induced protection in mice (Brown et al., 1994). 

Further involvement of S2, in the induction of VN antibodies has been 
shown by the finding that a decapeptide, corresponding to the sequence starting 
at residue 365 from the N-terminus of S2 of MHV-JHM, not only induced VN 
antibodies but also induced in mice some protection from a lethal challenge of 
MHV (Fig. 5a) (Talbot et al., 1988). Induction of a VN, and antimembrane fusion, 
MAb by MHV-JHM sequences in a bacterial fusion protein showed that S2 
residues 1123-1137 of MHV-JHM (equivalent to residues 1264-1276 in the 
MHV4 strain) (Fig. 5c) induced antibodies with these properties (Routledge et 
al., 1991). A VN MAb (2.2-V-1) selected a MAR mutant with a single amino acid 
substitution at residue 1113 in S2 of MHV-DL, a strain that has >99% identity 
with MHV-JHM (Fig. 5) (Wang et al., 1992). 

Recently Daniel et al. (1993) have investigated the antigenicity of bacterial 
fusion proteins containing overlapping fragments of the S protein of MHV-A59 
(Luytjes et al., 1989). This strain lacks many of the residues of the hypervariable 
region described above (Fig. 5a). The products were examined by immunoblot- 
ting, using mostly MAbs that had previously been shown to bind to denatured S 
(most likely binding to continuous epitopes) but also a few MAbs against 
discontinuous epitopes. The MAbs had been raised against native S protein. 
Five domains (I- V) of S, in both S1 and S2, bound 13 of these antibodies (Fig. 5], 
of which sites I, III, and IV bound VN antibodies. A synthetic 13-residue peptide, 
conserved among MHV strains, was located within domain III (Luytjes et al., 
1989). The combined data further support the view that sequences that are 
linearly situated far apart, e.g., domains II and V, are closely situated in the 
native protein, forming discontinuous epitopes. 

In summary, for MHV, VN epitopes are formed largely by residues in S1, but 
there are some VN epitopes in S2. In some cases there may be interaction 
between S1 and S2 residues, brought into juxtaposition by the folding of the 
protein, to form VN epitopes, but there are also epitopes in S1 and S2 that are 
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independent of each other. Some epitopes in both S1 and S2 are continuous in 
nature, while some VN epitopes in SI, at least, rely on conformation for anti- 
body binding. 

The S proteins of BCV, TCV, HEV, HCV-OC43, and MHV are antigenically 
related, as shown using both sera and MAbs (Hogue et al., 1984; Tijssen et al., 
1990; Vautherot et al., 1990, 1992a; Michaud and Dea, 1993). 

Using five MAbs in competition experiments, Deregt and Babiuk (1987) 
detected two nonoverlapping sites, both associated with VN antibody, in S of 
BCV. Following interaction of the MAbs with fragments of S generated with 
proteases, Deregt et al. (1989a) tentatively concluded that these two VN 
antibody-inducing domains mapped within the S1 subunit. This was later con- 
firmed using S1 that had been expressed independently of S2 using a baculovirus 
expression system (Yoo et al., 1990]. More detailed analysis has been performed 
by Vautherot et al. (1992a) who used 30 anti-S MAbs. Nineteen of these defined 
four independent antigenic sites on S1, one of which (SIB] was a major and the 
other a minor (S1-A) inducer of VN antibody. Site 1B could be subdivided. All the 
anti-S VN MAbs bound to S1. Some of the SI VN MAbs failed to react with SDS- 
denatured polypeptides, while others did react provided that the polypeptides 
had not also been reduced, indicating that the epitopes were conformational. 
Although MAR mutants were obtained, no sequence data are available for 
these. However, Yoo et al. (1991b) have constructed a set of deletion mutants of 
S1, expressed them in a baculovirus system, and examined the binding of the A 
and B site VN MAbs of Deregt et al. (1989a]). Only mutant S1 containing residues 
324-720 bound the MAbs. The hypervariable region of this group of viruses is 
situated in the middle of this 324-720 sequence (Fig. 4). Two immunoreactive 
domains were further delineated, namely, one between residues 324 and 403 
situated on the N-terminal side of the hypervariable region and one between 
residues 517-720, the N-terminal portion of which is part of the hypervariable 
region. Mapping of the 37-kDa trypsin fragment previously examined by Deregt 
et al. (19892) further indicated that the second domain was formed by residues 
517 to 621, most of which is in the hypervariable region, and that the first 
domain comprised residues 351—403. It was not possible to assign site A and B 
MAbs to one or other of these domains; both were required for binding of 
each MAb. 

Six MAbs, all non-VN, defined two sites on S2 (Vautherot et al., 19924]. 
Analysis of prokaryotic expression products and synthetic peptides have shown 
that MAbs to site S2A, and no other S2 or S1 MAbs, and polyclonal sera against 
BCV bound to epitopes formed by residues from within the first 20 N-terminal 
residues of S2 (Vautherot et al., 1992b). The recognition of these epitopes by 
convalescent serum showed that these epitopes induced antibodies in the host 
animal. MAbs to these epitopes bind not only to BCV strains but also to isolates 
of MHV, HEV, and HCV-OC43 (Vautherot et al., 19923), even though there are 
amino acid differences in this region. A similarly situated continuous epitope 
has been identified at the N-terminus of S2 of IBV, although this induced 
weakly neuralizing antibody (Kusters et al., 1989b; Kant et al., 1992). 

The MAbs of Vautherot et al. (19922) to the spike VN sites A and B were 
specific to BCV, whereas antibody to the other two, non-VN SI sites (C and DJ 
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bound to HCV-OC43 and one S1-D MAb bound to HEV and MHV. However, 
some BCV site B MAbs of Michaud and Dea (1993) did bind to HCV-OC43. 
These MAbs were highly VN and therefore probably correspond to the A and B 
sites of Vautherot et al. (1992a), Deregt and Babiuk (1987), and Deregt et al. 
(1989a,b), which are located within and to the N-terminal side of the hypervari- 
able region (Yoo et al., 1991b], as described above. It would be expected that 
some site A and/or B MAbs would bind to HCV-OC43 since the sequence of the 
BCV antigenic domain containing residues 351-403 is identical to that in HCV- 
OC43 (Künkel and Herrler, 1993; Mounir and Talbot, 1993). Some variation 
among BCV isolates has been detected using MAbs (El-Ghorr et al., 1989; 
Hussain et al 1991; Michaud and Dea, 1993). Some of the anti-S MAbs raised 
against the cell-adapted, attenuated Mebus strain failed to bind to virulent field 
strains, which is in keeping with the sequence data that show that the field 
isolates resemble each other more than they do the Mebus strain. In addition, 
some anti-Mebus spike MAbs differentiated some Quebec isolates from each 
other. Michaud and Dea (1993) have shown that BCV and TCV had many spike 
VN epitopes in common. 

In summary, the BCV VN epitopes identified to date reside in S1, probably 
located near and within the hypervariable region (Fig. 4). Few of these epitopes 
were common to other viruses of the group, whereas there was greater conser- 
vation with respect to the non-VN epitopes in Sl and even more so for S2 
epitopes. These findings are consistent with our knowledge of the S sequences 
of these viruses. 


3. IBV 


Since its isolation in 1936, many serotypes of IBV have been defined by VN 
tests. This antigenic variation is of great practical importance, resulting in the 
production of vaccines to several serotypes and/or the frequent modification of 
prophylaxis protocols (King and Cavanagh, 1991]. 

The major VN antibody sites of IBV reside in S1. This has been deduced as 
(1) virus from which S1 had been removed by urea did not induce VN antibody 
(Cavanagh, 1983c; Cavanagh and Davis, 1986); (2) urea-released S1, although in 
monomeric form, did induce VN, and hemagglutination-inhibiting antibody 
(Cavanagh and Davis, 1986]; and (3) strongly neutralizing MAbs bound to SI 
(Mockett et al., 1984; Koch et al., 1990; Karaca et al., 1992; Parr and Collisson, 
1993) and selected MAR variants with mutations in S1 (Fig. 6) (Cavanagh et al., 
1988; Koch et al., 1990; Kant et al., 1992). The dependence on conformation for 
the antigenicity of the S1 sites has been shown not only by the failure of anti-S1 
VN MAbs to react in Western blots but also by reduced antibody binding after 
deglycosylation of S with glycosidases (Koch and Kant, 19904]. 

One site in S2 (site G) induced weakly neutralizing antibody (Kusters et al., 
1989b). Analysis with prokaryotic expression fragments and with synthetic 
peptides has shown that this S2 site, which is not denatured by SDS and 
mercaptoethanol (Koch and Kant, 1990a), is situated within the first 20 residues 
after the N-terminus of S2 (Fig. 6) (Lenstra et al., 1989; Kusters et al., 1989b; 
Posthumus et al., 1990a,b}. Part of this sequence is strongly conserved among 
isolates with otherwise very variable S sequences. A similarly situated se- 
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24 132 291 325 361 
60 149 299 332 387 
61 302 333 394 
63 323 334 398 





FIGURE 6. Location of some antigenic sites on the S protein of IBV. Conformation-dependent sites 
A to E are associated with VN epitopes and were defined by the sequencing of MAR mutants; amino 
acid position 63 was identified on strain M41 and most of the others on strains of the D207 group. 
The numbers refer to amino acid positions relative to the D207 isolate. The order of sites A, B, and C 
is uncertain. MAbs to site D also inihibited hemagglutination. Sites F and G contain epitopes that 
are not conformation-dependent and are defined using bacterial expression products and peptides 
and are associated with weakly neutralizing antibodies. sp, Signal peptide; cp, basic S1-S2 connec- 
ting peptide. 


quence in the S protein of BCV is also very antigenic (Vautherot et al., 1992b). 
Kusters et al. (1989b) have suggested that the beginning of S2 is a protruding 
protein segment with a high local mobility. 

Competition assays with 27 anti-S MAbs resulted in the identification of 
six antigenic sites (A—F) in S1 of the D207 strain and two (G, HJ) in S2 (Fig. 6) 
(Koch and Kant, 1990a; Koch et al., 1990). One site, E was defined using bacterial 
expression proteins and one weakly neutralizing MAb (Fig. 6) (Kusters et al., 
1989b). The other five S1 sites bound VN antibody, two (D, E) and three (A, B, 
and C] sites being located in the first (region I) and third (region III] quarters of S, 
respectively (Figs. 6 and 7). MAR mutants selected with the VN MAbs exhibited 
amino acid changes at a total of 16 positions, 7596 of these being in region I, the 
remainder in region III (Figs. 6 and 7). MAbs to site D, in addition to being VN, 
inhibited hemagglutination and selected MAR mutants of IBV-D207 (Kant et 
al., 1992) and IBV-M41 (Cavanagh et al, 1988) at positions 60-63 from the 
N-terminus of S1, including signal peptide. | 

The entire S protein gene of nine isolates of IBV has been sequenced, 
revealing that S1 can vary > fivefold more than S2 (references in Section II.A). 
The amino acid differences in S1 are not scattered randomly but tend to be 
concentrated in regions I and III (Fig. 7). Among 13 strains of one serotype 
(Massachusetts] there were more than fivefold more variable positions in region 
I than in region II (Fig. 7b) (Cavanagh et al., 1988, 19923). Since all these strains 
were of the same serotype, this shows that the variation in region I did not make 
a great impact on serotype. 

In a group of seven IBV isolates, including UK/6/82, and the D207 strains 
used by Koch and colleagues for MAR mutant studies, in which the maximum 
difference between any two strains was only 5% of S1 residues, 88% of the 
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FIGURE 7. Amino acid variaton of the S1 sunbunit of the IBV spike protein. The rectangles 
represent the first (amino acid residues 1-130], second (131-260), third (261-391), and fourth (392 to 
approximately 520} quarters of S1, signal peptide excluded. The numbers show the number of amino 
acid differences in each quarter for the groups of isolates specified. (a) Location of amino acid 
changes among 16 MAR mutants of IBV, strain D207 (Kant et al., 1992). (b) Location of amino acid 
differences among a group of 13 strains (excluding Beaudette) of the Massachussetts serotype 
(Cavanagh et al., 1988, 1992b). (c) Location of amino acid differences among a group of seven 
isolates, including D207 and UK/6/82, where the maximum difference between two isolates is 596 
(Cavanagh et al., 1992a]. (d) Number of amino acid differences between members of the UK/6/82 
group and the Massachussetts M41 strain, the difference between M41 and individual members 
of the UK/6/82, group being approximately 2096. 


differences were distributed approximately equally among regions I and III (Fig. 
7c) (Cavanagh et al., 1992b). Comparison of the MAR mutant and field isolate 
data shows that the location of changed residues in MAR mutants is very 
similar to that of the amino acid differences among field isolates, indicating 
that many of the differences in regions I and III of field isolates are associated 
with differences in antigenicity. Indeed, despite their overall high amino acid 
identity, these seven strains represented three serotypes, supporting the view 
that changes in these regions of S1 can be advantageous to the virus, in respect 
to avoidance, if only partial, of immune responses induced in chickens by prior 
infection. A comparison of Fig. 7a-c suggests that changes in region III may 
contribute more to the serotype than region I. The observation that isolates that 
differed by only a dozen or fewer amino acids in S1 exhibited very little cross- 
neutralization suggests that most of the VN antibody was induced by a small 
number of immunodominant epitopes. 

As the differences between serotypes rise to about 2096 of S1 residues, e.g., 
when the UK/6/82, group is compared with the Massachusetts M41 strain, the 
number of amino acid differences in regions II and IV does increase, but there is 
also a marked increase in the number of changes in regions I and III (Fig. 7d) 
(Cavanagh et al., 1992a; Cavanagh and Davis, 1992). To what extent changes in 
regions II and IV contribute to antigenic variation is not known. 

Recently, the sequence of S1 of several of the “classical” North American 
reference isolates has been determined, including the Arkansas 99 (Ark99) 
isolate (Wang et al., 1993; Jia et al., 1993a), SE17 and Ppl4 (Wang et al., 1993), 
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Gray (Kwon and Jackwood, 1993; Wang and Collisson, 1993), and JMK (Kwon 
and Jackwood, 1993), Holte (Wang and Collisson, 1993), CU-T2 (Jia et al., 19933), 
Iowa-609 (Wang et al., 1994) and parts of S1 for the Connecticut-46 and Florida 
isolates and two strains recently isolated in Delaware (J. Gelb, C. Keeler, and 
W.A. Nix, personal communication). This has revealed that some of these 
strains differ substantially from each other (about 2096 of S1 residues], while 
other isolates have high identity (79596). The Gray and JMK strains share 98% 
identity, consistent with them being placed in the same serotype (Hopkins, 
1974). However, Hopkins, unlike Johnson and Marquardt (1975), placed the 
Holte strain in the same serotype as JMK, whereas they differ by about 2796 in 
the first 200 or so amino acids of S1. Similarly, Johnson and Marquardt consid- 
ered that the Ark99 and Connecticut-46 were different serotypes, consistent 
with the finding that they differ by 29% in the first 200 residues, whereas 
Hopkins placed them in the same serotype. These results further indicate that 
serology is not a firm foundation on which to build a system for establishing 
relationships between IBV isolates. 

The first 350 or so residues of S1 of many Australian strains, isolated over a 
30-year period, have recently been determined (S. Sapats, P. J. Wright, and Y. 
Ignjatovic, personal communication). Comparisons show that many isolates 
obtained in the 1960s and 1970s had about 80% identity with each other and 
with several European and North American strains in this part of S1. Most 
interestingly, however, two strains isolated in different parts of Australia in the 
same year (N1/88 and Q3/88) had only 6796 identity with each other over the 
whole of S1 and had percentage differences in the first 350 residues of about 5596 
(suggesting a possible difference of about 6796 for the whole of S1) when com- 
pared with earlier Australian isolates and with non-Australian strains. Only 
vaccines based on Australian isolates have ever been permitted in Australia, 
and importation of live poultry was banned 50 years ago. Thus Australian 
strains of IBV may have been evolving independently, at least for the lifetime of 
the intensive poultry industry. 

Another contributor to IBV variation is recombination during mixed infec- 
tion, and there is sequence evidence that indicates that recombination has 
occurred within the S gene (Kusters et al., 1989a, 1990; Cavanagh et al., 1992a, 
Wang et al., 1993; Jia et al., 1993b). One consequence is that characterizing an 
IBV isolate by partial sequencing of S1 may not always result in the establish- 
ment of a more accurate relationship between two strains than has been 
achieved by serological analysis. Where it is important to establish the degree of 
relatedness of strains it would be advisable to sequence more than one part of 
S and also part of another gene. | 


IV. INDUCTION OF PROTECTIVE IMMUNITY 


There is strong evidence that the S protein is a major inducer of protective 
immune responses, although the other structural proteins undoubtedly also 
play a role (see Chapters 6 and 8, this volume). C57 BL/10 strain mice were 
protected from a lethal infection of MHV-3 by immunization with sucrose 
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gradient-purified S (Hasony and Macnaughton, 1981), as were BALB/C mice 
immunized with affinity-purified S and challenged with MHV-A59 (Daniel and 
Talbot, 1990], although there were some clinical signs. Some anti-S MAbs also 
gave a measure of protection against MHV (Nakanaga et al 1986; Koolen et al., 
1990). When the MHV-A59 S protein was expressed in mice using an adenovirus 
vector, 50-90% of the mice, dependent on the challenge dose, were protected 
against the lethal effects of intracerebral A59 infection (Wesseling et al 1993]. 
A recombinant adenovirus expressing the N protein also induced protective 
immunity, the protection being greater when N and S were jointly expressed 
compared with either protein alone. A peptide corresponding to a nine amino 
acid sequence located 131 residues from the amino-terminus of S2 (Fig. 5) 
(Luytjes et al., 1989) induced protection in mice (Koolen et al., 1990). This 
sequence has also been expressed as a chimeric protein including the VP2 capsid 
protein of a human parvovirus, expressed by a recombinant baculovirus (Brown 
et al., 1994). Sixty percent of mice vaccinated with a chimeric protein and 
subsequently challenged with MHV A59 survived the challenge. 

IBV from which the S1 subunit had been removed by urea was unable to 
induce immunity to respiratory infection in chickens (Cavanagh et al 1986c), 
indicating that the S protein, and possibly the S1 subunit in particular, was 
required for immunity. Chickens vaccinated with live vaccine of one IBV 
serotype are frequently poorly protected against challenge by other serotypes 
(Johnson and Marquardt, 1975). Since the S protein varies to a greater extent 
than the other virion proteins and given the experimental evidence for a major 
role in protection played by the S protein, it is likely that poor cross-protection 
is due in large measure to differences in the epitopes of the S protein. 

Epitopes on the S protein of TGEV are more immunogenic in pigs than 
epitopes on the N and M proteins and are considered to be of major importance 
in the induction of lactogenic immunity (Diego et al., 1992). Vaccination of cats 
against FIPV can be disadvantageous since antiviral antibodies lead to enhance- 
ment of infectivity of FIPV for monocytes and macrophages, leading to death of 
the animal (early death syndrome]. That the S protein is involved has been 
shown by vaccinating cats with a vaccinia virus recombinant expressing the 
FIPV S protein; the cats succumbed earlier than controls after challenge with 
FIPV (Vennema et al., 1990a,d). 


V. BIOLOGICAL FUNCTIONS 
A. Primary Attachment to Cells 


This section concerns the virus components that mediate attachment to 
cells; cell receptors for coronaviruses are discussed in Chapter 4. It is generally 
assumed that the attachment of coronaviruses to cells is mediated by the S 
protein (Collins et al., 1992), especially in the absence of the HE protein. Many, 
although not all, VN MAbs are directed against S (see Section M). Some corona- 
viruses that do not have a HE protein can cause hemagglutination (HA). IBV can 
cause HA, although this is inefficient. In the past, manifestation of the HA 


CORONAVIRUS SURFACE GLYCOPROTEIN 95 


capacity of IBV has involved concentration of the virus and treatment with a 
crude preparation of Clostridium welchii phospholipase C type I (Bingham et 
al., 1975; Cavanagh and Davis, 1986]. A recent study has shown that treatment 
of IBV with purified neuraminidase (from Vibrio cholerae and from Newcastle 
disease virus) efficiently converts IBV into good HA material, indicating that it 
was the presence of small amounts of neuraminidase that had been effective in 
the crude bacterial extracts (Schultze et al., 1992). Neuraminidase treatment of 
erythrocytes abolished HA with neuraminidase-treated IBV. This indicates that 
sialic acid (a2,6 linkage) on the erythrocytes was acting as a receptor. It is 
unclear why neuraminidase treatment of the virus enhanced HA. One reason 
may be that cellular material was attached to the S protein, via a sialic acid link, 
and that the neuraminidase "cleaned" this off. That it is the S protein that is 
responsible for attachment of IBV has been shown in a number of ways. Some 
MAbs reactive with S (Mockett et al., 1984) and polyclonal antibody raised 
against purified S (Cavanagh et al., 1984) inhibited HA. Moreover, the HA- 
inhibiting antibody was associated with the S1 subunit (Mockett et al., 1984; 
Cavanagh et al., 1986c], virus lacking S1 could not cause HA and was noninfec- 
tious (Cavanagh et al 1986c) and the HA-inhibiting MAbs also had VN activity. 
Porcine TGEV that, like IBV, does not have an HE protein can also cause HA 
(Noda et al., 1987, 1988]. 

The HE protein, present in a subset of the coronaviruses, can also attach to 
erythrocytes (see Chapter 8, this volume]. Schultze et al. (1991) have purified 
the S and HE proteins of BCV. Both proteins recognized the same type of sialic 
acid receptor, but the S protein caused HA more efficiently than did HE. They 
proposed that S was responsible for the primary attachment of the virus to cells. 


B. Membrane Fusion 


Great care must be taken when generalizations are made about corona- 
viruses, not least where membrane fusion is concerned. It can be stated that 
S-induced cell-cell fusion (syncytium formation) does not have an absolute 
requirement for cleavage of S. Indeed, the S protein of members of the TGEV 
group is not cleaved but can induce syncytia. For those coronaviruses that do 
produce a cleaved S, prevention of cleavage does not necessarily result in the 
loss of fusion activity, although it results in less efficient fusion induction. No 
single sequence solely responsible for fusion has been identified. Indeed, the 
evidence to date is that several regions, in terms of the linear S molecule, 
influence fusogenic activity. With regard to pH, the spike protein of many 
strains of all coronavirus species so far examined will induce cell-cell fusion 
optimally at neutral pH and slightly above. For these viruses, fusion of an 
incoming virion with a cell membrane is affected little or not at all by 
lysosomotropic agents, since there is no detrimental effect on the successful 
infection of cells. Since lysosomotropic agents raise the pH in endosomes away 
from acidity, this indicates that virion-cell fusion either occurs at the cell 
surface or within endosomes prior to their acidification. However, there are 
some very interesting exceptions to this general observation. 
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1. Evidence for the Role of S in Fusion 


Expression of the S gene of FIPV and MHV-JHM using virus vectors has 
provided unequivocal evidence that S alone among the virus proteins is re- 
quired for syncytium formation (de Groot et al., 1989; Pfleiderer et al., 1990; 
Grosse and Siddell, 1994). The identification of a fusion domain has not proven 
possible, given that sequences in both S1 and S2 glycopolypeptides are involved 
and that the three-dimensional structure of S has not been deduced. Bacterial 
expression products, containing nonconformational epitopes, corresponding to 
both S1 and S2 sequences have induced MAbs with fusion-inhibiting activity 
(Routledge et al 1991). A nine-residue peptide, corresponding to the sequence 
beginning 131 residues from the N-terminus of S2 (strain A59), binds a fusion- 
inhibiting site A MAb (Luytjes et al., 1989) (position 900-908 in Fig. 5a). 
Recombination experiments have also supported the view that S2 of MHV is 
involved in fusion (Keck et al., 1988) and S2 of BCV expressed in insect cells 
caused cell fusion (Yoo et al., 1991a). Amino acids in both of the hydrophobic 
regions of S2 that exhibit heptad repeats and are believed to form the coiled-coil 
structure of the oligomeric S protein have been associated with fusion activity 
(Fig. 5e). Gallagher et al. (1991) identified three residues, at positions 1067, 1094, 
and 1114 in the MHV4 S protein, which together were associated with the loss of 
fusion at neutral pH and a requirement of acidic pH, although it is not clear 
exactly how these changes cause such a marked effect on fusion (Fig. 5e]. A 
peptide corresponding to residues 1264-1276 of MHV4 induced MAb 10G, 
which had both VN and antifusion activity (Fig. 5) (Routledge et al 1991). In the 
same study a peptide of residues 33—40, i.e., near the N-terminus of S1, also had 
both of these activities. The anti-S2 MAb 10G might destabilize the oligomeric 
structure of S and thereby interfere with the interaction of the fusogenic region 
with a cell membrane. A similar explanation may apply to MAb 19.2, which 
inhibits fusion and binds to a linear sequence within antigenic domain III 
(Lutyjes et al., 1989; Daniel er al., 1993), although this is not located directly 
within the hydrophobic areas (Fig. 5). It must be remembered, however, that S 
protein-induced cell-cell fusion is at least a two-step process, the fusion event 
being necessarily preceded by the attachment of the S protein, anchored in the 
plasma membrane of a cell, to the surface of an adjacent cell. Cells that were 
expressing the S protein of FIPV or MHV, following infection with a vector 
carrying the relevant S gene, fused with cocultivated cells when those cells 
were of a type that were susceptible to FIPV or MHV infection, respectively, but 
not to cells that were refractory to infection with these viruses (de Groot et al., 
1989; Vennema et al., 1990b,c). This not only indicates that S must first attach 
to a receptor prior to fusion, but also that there is some specificity in that 
attachment. Taguchi et al. (1992) have shown, however, that vaccinia virus 
expression of MHV S resulted in fusion not only in murine cells but also in 
rabbit RK13 cells, which are refractory to infection but would appear to have a 
receptor for MHV, strain cl-2. It may be that MAbs that inhibit fusion might do 
so either by inhibiting attachment of S to cell membranes (where Scan be either 
on virions or on the surface on an infected cell, adjacent to a noninfected one] or 
the fusion event. Some site A and B MAbs of MHV neutralize virus in addition 
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to inhibiting fusion. The VN and antifusion MAb 11F that binds near the 
beginning of S1 (Fig. 5c) could interfere with the attachment process, thereby 
indirectly affecting fusion. Alternatively, residues 33-40 may be in close prox- 
imity to residues 1264-1319 in the native protein, this association being neces- 
sary for fusogenic activity; hence, this activity would be inhibited to antibodies 
against either linear location. A third possibility is suggested by the observation 
that MAR mutants selected by MAb 11F had mutations not in residues 33-40, 
which are known to have induced MAb 11F (Routledge et al., 1991), but in S2 
residues 1109, 1110, and 1116 (Grosse and Siddell, 1994). Sturman et al. (1990) 
have shown with MHV that mildly alkaline conditions result in the irreversible 
dissocation of S1 and S2. Furthermore, they have suggested that this event is 
necessary for fusion activity. It has been proposed (S. G. Siddell, personal 
communication) that MAb 11F might interfere with this dissociation process, 
given that the residues 33-40 would seem to be closely associated with residues 
1109-1116 in the mature S protein. It should be remembered that the S proteins 
of some coronaviruses, e.g., those of the TGEV group, are not cleaved into S1 
and S2 glycopolypeptides (Sections I.B.2 and IV.B.3). Therefore, if rearrange- 
ment of the N-terminal and C-terminal halves of S is a part of the fusion- 
activation process, this does not necessarily result in the total separation of S1 
and S2. Knowledge of the three-dimensional structure of S would greatly facili- 
tate interpretation of these data. 


2. Role of pH in Fusion 


There are two stages at which the fusogenic role of S may come into play, 
the first of which can be considered essential for natural infection. First, the 
virion membrane must fuse with a cell membrane to release the genome into 
the cytoplasm. Second, newly produced S protein may, on being transported to 
the plasma membrane, cause an infected cell to fuse to an adjacent, noninfected 
cell, thereby allowing spread of the infection without the absolute necessity for 
release of virions into the extracellular environment. Many investigations of 
the pH requirements for coronavirus fusion have focused on cell-cell fusion, to 
form syncytia, either by the addition of large amounts of virus to cells to give 
rapid syncytium formation [fusion from without (FFWO), a gross example of 
virion-cell fusion] or simply by observing syncytium formation after virus 
replication [fusion from within (FFWI)]. The virion-cell membrane fusion that 
results in genome uncoating has been studied by addition of lysosomotropic 
agents (ammonium chloride, chloroquine). These raise the pH in endosomes, 
the vesicles in which viruses become surrounded if they do not fuse at the cell 
surface, from «6.0 to almost 7.0. The rationale is that if infection is inhibited or 
greatly decreased by a lysosomotropic agent, this is indicative that an acidic 
environment is necessary for activation of the fusion protein. 

Observation of fusion induction by the S protein of MHV, FIPV, BCV, and 
IBV have shown that for many strains fusion (virion-cell and cell-cell) occurs 
optimally at about pH 7.0 or slightly greater. This has been shown by experi- 
ments on FFWO with MHV-A59 (Sturman et al., 1985; Frana et al 1985; Kooi et 
al., 1991) and on FFWI following MHV4 replication (Gallagher et al., 1991). 


98 DAVID CAVANAGH 


MHV-induced FFWI was inhibited when the culture medium had a pH «7.0 
(Sawicki and Sawicki, 1986). Similar observations have been made with BCV 
(Payne and Storz, 1988). Neutral pH was sufficient for FFWI following expres- 
sion of FIPV and MHV S protein from vaccinia virus vectors (de Groot et al., 
1989; Vennema et al., 1990b,c; Pfleiderer et al., 1990; Taguchi et al., 1992; 
Stauber et al 1993). Lysosomotropic agents had little effect on the infection on 
cells by BCV and IBV strain UK/123/82 (Payne and Storz, 1988; Li and Cavanagh, 
1990, 1992). Electron microscope observations showed fusion of BCV virions at 
the cell surface but not at internal membranes. 

Kryzstyniak and Dupuy (1984) reported that infection by MHV3 was 
greatly diminished by lysosomotropic agents. Mizzen et al. (1985) found that 
ammonium chloride did not decrease the number of cells infected but did 
extend the eclipse phase. That is, in the presence of the agent, virus was 
transferred from the cell surface (as shown by resistance of proteinase-K treat- 
ment of the cells) and remained as infectious virus within the cells for an 
appreciably longer time than in the absence of ammonium chloride. However, 
even in the presence of ammonium chloride the cells ultimately became in- 
fected, and virus yields were the same as the controls. Kooi et al. (1991) have 
confirmed that ammonium chloride delays the appearance of new virus by 
about 4 hr and that the final titers were the same as in the controls. In parallel 
experiments the titer of vesicular stomatitis virus, a virus known to require an 
acidic pH for fusion, was greatly reduced in the presence of the lysosomotropic 
agent. These authors also showed that MHV-A59 was endocytosed by Vero and 
rat astrocytoma (C6) cells, but this did not lead to productive infection. The 
latter did occur after cells to which virus had attached were treated with 
polyethylene glycol to cause fusion at the cell surface. This agreed with the 
earlier work of Van Dinter and Flintoff (1987). The interpretation of these 
results is that although MHV-A59 can be endocytosed, this may result in 
transfer of virus to lysosomes, where it is destroyed. Rather, it is considered that 
the fusion of MHV-A59 virions with cells is not dependent on endocytosis and 
certainly not on an acidic environment, and occurs at the cell surface or pos- 
sibly in endocytic vesicles and/or endosomes prior to acidification. 

There are, however, exceptions to these observations. The productive in- 
fection of Vero and CK cells by IBV-Beaudette (in contrast to the UK/123/82 
strain) was reduced by 80-95% when ammonium chloride was present early in 
infection, with a corresponding 0.9 log,, decrease in subsequent virus produc- 
tion by 10 hr postinfection, near the end of the first replication cycle (Li and 
Cavanagh, 1990, 1992). Syncytium formation by FFWI occurred optimally at pH 
6.7. These results suggest that IBV-Beaudette virions fuse shortly after transfer 
to endosomes in a slightly acidic environment. Electron microscope investiga- 
tions of the entry of IBV-Beaudette into chorioallantoic cells and CK cells did 
not reveal any fusion of the virus at the cell surface but did observe entry by 
endocytosis (Chasey and Alexander, 1976; Patterson and Bingham, 1976). In 
contrast, Payne et al. (1990) observed that BCV fused with plasma membrane 
but not with endosomal membranes, consistent with the observation that 
ammonium chloride had little inhibitory effect on BCV infection. 

A more dramatic example has been described by Gallagher et al. (1991). 
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They showed that MHV4, in susceptible murine DBT and Sac-— cells, caused 
syncytium formation by FFWI at pH 5.5-8.5, the optimum being at pH 7.0 and 
above. Lysosomotropic agents did not reduce replication, as assessed by the rate 
of viral RNA synthesis, although there was a delay in the time taken for peak 
production in the presence of chloroquine. However, variants of MHV4 ob- 
tained from a persistently infected neural cell line caused little or no fusion in 
DBT or Sac- cells at pH 7.0 and above. Instead, the optimum pH for fusion was 
5.5-6.0. Correspondingly, productive infection by these mutants was inhibited 
substantially by lysosomotropic agents. The mutations corresponding to this 
changed phenotype were located in the larger of the heptad repeat areas of S2 
(Fig. 5e). Passage of the acid pH-dependent variants in Sac- cells resulted in 
selection of neutral pH-fusing revertants. Ammonium chloride and chloro- 
quine did not prevent replication of the revertants, although, as with MHV-A59 
(Mizzen et al., 1985; Kooi et al., 1991), they delayed virus replication by a few 
hours. 

The work of Gallagher et al. (1991) has shown that cell type was of para- 
mount importance in the selection of MHV4 variants with greatly differing pH 
requirements for fusion. One suggestion by Gallagher et al. (1991) as to why low 
pH-dependent virus was selected in OBL21A cells is that it may be unable to 
fuse at the cell surface but can fuse after transfer to endosomes, because of some 
difference in the membranes. It has been shown that the composition of the 
membrane, for example, with respect to cholesterol, can effect membrane 
fusion (Daya et al., 1988; Roos et al., 1990; Kooi et al., 1991). However, it does 
not necessarily follow that the spike protein gene should mutate to give a low 
pH-dependant protein. An alternative hypothesis (Gallagher et al., 1991) is that 
sustained replication in OBL21A neural cells might require that the incoming 
nucleocapsid becomes available deep within the cell and not just beneath the 
surface. This would be more likely to happen if the S protein required a low pH 
for fusion activation, achieved when the endosomes had traveled away from the 
cell surface. They further speculate that Sac— cells selected neutral pH- 
requiring revertants because syncytium formation is advantageous to the 
spread of the virus. 

In any event, the stage of nucleocapsid release from coronavirus virions is 
one that can be affected by cell type and may be a factor in determining host cell 
range and pathogenesis. Kooi et al. (1988) have further data that support the 
view that the degree to which cells are permissive to MHV can depend on the 
efficiency of the fusion process leading to the release of the nucleocapsid. 
Beushausen et al. (1987) have presented evidence that suggests that differenti- 
ated rat oligodendrocytes are resistant to MHV infection because normal pro- 
cessing of the phosphorylated nucleocapsid protein is inhibited in some 
manner. 


3. Requirement of Cleavage for Fusion 


Whether or not cleavage of the coronavirus S protein is required for its 
fusogenic activity has been studied in several ways, including the simple expe- 
dient of determining whether each coronavirus had cleaved or uncleaved S in its 
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virions. Experiments involved FFWO and FFWI, with and without trypsin 
treatment, and expression of the S protein using virus vectors, with and without 
mutant S genes. Some coronaviruses produce virions with up to 10096 cleaved 
S, while others have never been observed to have cleaved S (see Section LC 
Virions of both types are infectious, showing that cleavage is not essential for 
virus-cell fusion. However, several studies during the 1980s indicated that only 
cleaved S would induce fusion of one cell with another to produce syncytia. 

MHV-A59 from Sac- cells, which had 100% of S in the cleaved form, was 
able to cause FFWO when added to L2 cells (Frana et al 1985). In contrast, 
MHYV-A59 virus from 17Cl 1 cells, which had about 50% cleaved S (Frana et al., 
1985), was unable to cause FFWO unless it had previously been treated with 
trypsin to complete the cleavage process (Sturman et al., 1985). Similarly, BCV- 
L9 only caused FFWI and plaque formation when trypsin was used to increase 
the proportion of cleaved S from 30-60% to almost 100% (Storz et al., 1981; 
Payne and Storz, 1988; Cyr-Coats et al., 1988). A variant of MHV-A59, which 
had no cleaved S, produced only small plaques and also did not cause cell fusion 
(Sawicki, 1987). IBV-Beaudette-induced FFWI of Vero cells was increased if 
trypsin was present, the effect being most marked with virus that had not been 
previously adapted to growth in Vero cells. Plaque development was also en- 
hanced (D. Li and D. Cavanagh, unpublished observation]. However, virions of 
both no, low, and high Vero cell-passaged virus had the same proportion (ap- 
proximately 1:2) of uncleaved to cleaved S. 

Initially these experiments were interpreted as showing that for viruses 
that had a spike that was capable of being cleaved into S1 and S2, cleavage was 
necessary for fusogenic activity leading to syncytium formation. An alternative 
interpretation is that, for these coronaviruses, both cleaved and uncleaved S has 
fusogenic activity but that cleaved S is more fusogenic. In those cases where 
addition of trypsin enhanced cell-cell fusion, it would then be concluded that 
prior to the addition of trypsin there was not enough S at the cell surface capable 
of efficiently forming sufficient microfusion events that would ultimately co- 
alesce to form syncytia. Extraneous trypsin cleaved the uncleaved S molecules, 
thereby increasing the number and concentration of efficiently fusogenic S 
molecules to above a threshold value, gross fusion then occurring. 

This alternative view arises in part from experiments in which S proteins 
have been expressed using virus vectors. A cleaved form of the spike of FIPV has 
never been observed. When the FIPV S protein was expressed using a vaccinia 
virus recombinant, syncytium formation resulted (de Groot et al., 1989]. To 
determine if uncleaved S of MHV-JHM would induce syncytia, Stauber et al. 
(1993) and Taguchi (1993) mutated the highly basic S1-S2 connecting peptide 
(Arg-Arg-Ala-Arg-Arg) to Ser-Val-Ser-Gly-Gly and Arg-Thr-Ala-Leu-Glu, re 
spectively. When expressed, the mutant spikes were not cleaved, but they did 
induce cell-cell fusion. Taguchi (1993) observed a 2- to 4-hr delay in fusion in 
the cells expressing the mutant S, suggesting that although uncleaved S was 
fusogenic, it was perhaps less efficient than cleaved S. 

Additional support for this conclusion was provided by Gombold et al. 
(1993), who examined mutant MHV-A59 isolated from productively but persis- 
tently infected primary mouse glial cell cultures. The mutants grew normally 
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in L2 cells, but, whereas wild-type virus had caused almost 100% fusion of the 
monolayer by 10 hr after infection, the mutants had caused less than 1096 
fusion. Reduction of the pH had no enhancing effect on fusion, but trypsin did 
do so for several mutants, which in the absence of the enzyme produced only 
uncleaved S. Sequencing revealed that the mutants had the connecting peptide 
Arg-Arg-Ala-Asp-Arg instead of the wild-type Arg-Arg-Ala-His-Arg, i.e., a nega- 
tively charged aspartic acid residue in place of a weakly basic histidine. Anal- 
ysis of revertants, which produced cleaved S and fused cells efficiently, showed 
that all the revertants had lost the aspartic acid residue in the connecting 
peptide, it having been replaced by the wild-type histidine in one case but by the 
small, uncharged residues alanine or glycine in others. These results showed 
that the presence of a negatively charged residue one position away from the 
C-terminal end of the S1-S2 connecting peptide prevented cleavage; that a basic 
residue at this position was not essential for cleavage; and that while cleaved S 
was not required for virus—cell fusion and was not essential for cell-cell fusion, 
uncleaved S was less efficient at inducing cell-cell fusion. The observation that 
virions of MHV and BCV with 50% or less cleaved S are infectious also shows 
that cleavage of all the S molecules is not necessary for the fusion between 
virion and cell membranes that results in genome uncoating. 

The seemingly greater capacity of the S protein to fuse virions with cell 
membranes may be because the concentration of S molecules in the virion 
envelope may be greater than at the surface of an infected cell. If this is the case, 
then virions would be expected to cause more microfusion events, in close 
proximity to each other, resulting in fusion. Syncytium formation is a rather 
gross manifestation of S-induced membrane fusion, a process that is probably 
preceded by small, localized fusion events between adjacent cells. If this is the 
case, then it may be the progression from the localized to the extensive fusion 
that requires a particularly high density of cleaved S. 

In conclusion, the process by which the coronavirus S protein induces 
membrane fusion would seem to be a complex, stepwise one. Interaction of the 
S protein with a cell surface may require primary and secondary receptors. This 
attachment process may induce conformational changes that ultimately result 
in gross changes in the juxtaposition of S1 and S2 sequences, followed by 
membrane fusion. Protonation is not required in most cases. 


VI. ROLE OF S IN PATHOGENICITY 


It is likely that the coronavirus spike glycoprotein is a major determinant 
of cell tropism and hence of pathogenicity. However, in very few, if any, cases 
can it be definitively stated that a change in the S protein was responsible for a 
different biological behavior. In some mutants, genes encoding other proteins 
are known to have changes, while in all mutants there may well be mutations 
in other genes that have not been detected. Infectious DNA copies of corona- 
virus genomes will be required to obtain unequivocal answers about the role of 
the S protein in the varied pathogenicity of coronaviruses. However, it is 
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worth highlighting those cases in which changes in virulence or tropism have 
been paralleled by changes, sometimes very marked ones, in S. 

Several members of the TGEV group exhibit >90% amino acid identity in 
the S protein except for the first 200-300 residues, which are either missing or 
have identities as low as 30% (Fig. 3; Section III B 1). The most stiking example, 
perhaps, is that of TGEV compared with its mutant PRCV, with their tropisms 
for the alimentary and respiratory tracts, respectively. From the S gene to the 
poly-A tract there is 9896 identity overall, except that PRCV lacks the first 224 
amino acids of the TGEV spike. It is very tempting to believe that this deletion 
is responsible for the different tropisms of these viruses, one of which (PRCV} 
causes mild, sometimes inapparent, disease, while the other (TGEV]) is fre- 
quently lethal for piglets. However, it is known that there are a few differences 
elsewhere in the genome, including some in gene 3 that have effectively re- 
sulted in the loss of the 3a open reading frame. 

Variation within S of FIPV may be responsible for differences in patho- 
genicity. Avirulent strains of FIPV were internalized less rapidly than virulent 
strains, from which they could be distinguished by anti-S MAbs (Fiscus and 
Teramoto, 1987b). 

Likewise, the hypothesis that many of the different tropisms exhibited by 
MHV strains and other similar coronaviruses are the result of differences in S is 
very attractive. In particular, the hypervariable region of S1 (Fig. 5) is a prime 
candidate for a determinant of pathogenicity. Strains of MHV cause a broad 
spectrum of CNS diseases in rodents as well as replicating in various non-CNS 
tissues (Matsubara et al., 1991). Many of the neuroattenuated variants have 
arisen during passage in tissue culture. The MHV4 strain may be considered as a 
prototype MHV strain. Passage in tissue culture has resulted in the loss, for 
example, of 141 amino acids from the hypervariable region of S1 to generate the 
JHM strain (Fig. 5), which prior to sequencing had been considered to be identi- 
cal to MHV4 (Baybutt et al., 1984; Parker et al., 1989; Gallagher et al., 1990). 

Interestingly, analysis of MHV-JHM mRNA obtained from the brains of 
Lewis rats at 5 to 7 days after intracerebral inoculation showed that the S 
mRNA was larger than that of the input virus (Taguchi et al., 1985). Virus 
isolated at the same time produced the larger S mRNA during replication in 
vitro, and produced a correspondingly enlarged S protein. It is likely that the 
inoculum was a mixed population, a small proportion of which had a “full-size” 
S protein very similar to the MHVA virus from which the JHM strain had been 
derived. Back-passage of the virus in rats then selected the virus with the full- 
length S. 

Morris et al. (1989, 1990} isolated variants from the CNS of Wistar Furth 
rats after infection with MHV-]HM, and two of these variants have been se- 
quenced by La Monica et al. (1991). One variant lacked the codons encoding the 
C-terminal 246 amino acids of the HE protein and had a 147 amino acid deletion 
in S1 (286 residues when compared with MHV4) (Fig. 5b). This variant caused 
demyelination in both 2- and 10-day-old rats, whereas the wild-type virus only 
produced this in the 10-day-olds, causing an acute encephalitis virus in 2-day- 
old rats. Another variant, which induced acute encephalitis in rats of both ages, 
did not have the deletion in S but did have a deletion in the HE gene. 
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A neural cell line persistently infected with MHV4 selected variants that 
had a dependence on low pH for membrane fusion, in contrast to wild-type 
virus (Gallagher et al., 1991). Three mutations responsible for this were identi- 
fied in S2 (Fig. 5e). The authors suggested that this would favor virus fusion in 
endosomes rather than at the cell surface, releasing the nucleocapsid deeper 
within the cell, this being required for some unknown reason. Revertants were 
selected when the mutants were back-passaged in Sac- cells; the revertants, 
unlike the mutants, caused extensive syncytia formation. This enables virus to 
spread without the necessity of being released from cells; this may be an 
advantage in some instances. Different, fusion-related mutations were identi- 
fied by Gombold et al. (1993) after MHV-A59 had established productive persis- 
tent infection in mouse glial cells. The mutants had an altered S1-S2 connecting 
peptide and the spike was not cleaved and did not cause cell-cell fusion. Thus 
the spread of such mutants was limited to that attained by virions released from 
cells. A related example of a change in biological properties of MHV coincident 
with a change in S has been described by Fazakerley et al. (1992). A MAR mutant 
that had a 149 amino acid deletion in S1 had a neuroattenuated phenotype, 
considered to be a result of its reduced rate of spread in the CNS. Some other, 
similar deletion mutants have been selected from MHV4 stocks by VN MAbs. 
In four cell lines several such mutants had enhanced growth compared with the 
wild-type virus, attributed to delayed or diminished cytopathic effect, allowing 
cultures to support virus production for longer periods (Gallagher et al., 1990). 

Two MAb-selected variants of MHV-3 had a changed tissue tropism, the 
mutants accumulating more in the lung and kidneys rather than in the liver, the 
primary target organ of the wild-type virus (Martin er al., 1990). 

The S protein can affect the outcome of infection in other ways. MHV-JHM 
spike on the surface of infected cells can bind IgG via the Fc domain (Oleszak 
and Leibowitz, 1990). MHV-infected cells were lysed in the presence of spleen 
and lymph node cells from nonimmunized mice, leading to the proposal that S 
on the cell surface had, at neutral pH, caused fusion of the infected cell and 
those of the B cell lineage (Wysocka et al., 1989). 
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CHAPTER 6 


The Coronavirus Membrane 
Glycoprotein 


PETER J. M. ROTTIER 


I. INTRODUCTION 


Coronaviruses have a simple protein composition. While there is some varia- 
tion among different members, a basic set of four protein species universally 
occurs: the nucleocapsid protein INL the spike protein (S), a small membrane 
protein (SM), and the membrane glycoprotein (M). Some coronaviruses have an 
additional membrane glycoprotein (HE). The M protein, previously also called 
El, is the subject of this chapter. As will become clear, M is a peculiar glycopro- 
tein, different from all other viral glycoproteins in its structural and biochemi- 
cal features. These unique features may be responsible for important biological 
properties of coronaviruses, in particular for their intracellular budding. 

M is the most abundant virion protein. In murine hepatitis virus (MHV) it 
was estimated by isotopic labeling to comprise some 40% of the particle's 
protein mass, exceeding the nucleocapsid protein on a molar basis at a ratio of 
2:1 (Sturman et al., 1980). A similar ratio was obtained from incorporation 
studies with avian infectious bronchitis virus (IBV) (Stern et al., 1982), while 
equimolar ratios were determined for bovine coronavirus (BCV) (King and 
Brian, 1982) and human coronavirus (HCV] OC43 (Hogue and Brian, 1986]. In 
the latter case, M and N were calculated to be present at a rate of 726 molecules 
per virion. 

The M gene, together with the genes for the other structural proteins, is 
located in the 3' one third of the coronaviral genome, downstream from the 
spike protein gene and upstream from the nucleoprotein gene. As a conse- 
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quence of the specific mode of transcription of coronaviruses, the M protein is 
expressed from a mRNA that, in addition to the M sequence, carries extra 
genetic information including the N sequence. This additional information is 
located 3' from the M gene and is functionally redundant; the M protein is 
equally well translated from a mRNA derived from a cloned copy of the M gene 
(see Machamer and Rose, 1987; Mayer et al., 1988; Rottier and Rose, 1987}. 


II. PHYSICOCHEMICAL PROPERTIES 
A. Covalent Modifications 


The M protein is usually found as a family of differentially glycosylated 
proteins, including the unglycosylated precursor. These proteins span a M, 
range of 20 to 38kDa. The M protein of a particular coronavirus carries either N- 
or O-linked oligosaccharides (Table I). The only other modification identified so 
far is the addition of sulfate (Garwes et al., 1976), but it is unknown whether 
this constituent is attached to the oligosaccharide side chains or bound directly 
to the polypeptide through tyrosine. M is not acylated (Niemann and Klenk, 
1981; Schmidt, 1982), nor does it contain phosphate (e.g., Stohlman and Lai, 
1979; Rottier et al., 1981a; King and Brian, 1982; Hogue and Brian, 1986). 

The structures of the N-linked oligosaccharides carried by coronavirus M 
proteins have not been studied in detail, but are probably not different from 
those found in other viral and cellular N-glycosylated proteins. Carbohydrates 
bound to a polypeptide through O-linkage to serine or threonine residues are 
quite uncommon among viral proteins. Of the O-glycosylated coronaviral M 
proteins, the side chains of the M protein of MHV strain A59 grown in 17C11 
cells, a spontaneously transformed BALBC/3T3 line, have been analyzed 
(Niemann and Klenk, 1981; Niemann et al., 1984). Two size classes of oligosac- 
charides were released from the protein by p-elimination. Their structures, as 


TABLE I. Type of Glycosylation 
of M Proteins of Coronaviruses 


Common name Designation 
N-glycosylation 
Canine coronavirus CCV 
Feline infectious peritonitis virus FIPV 
Feline enteric coronavirus FECV 
Human coronavirus strain 229E HCV-229E 
Infectious bronchitis virus IBV 
Turkey coronavirus TCV 
Transmissible gastroenteritis virus TGEV 
O-glycosylation 
Bovine coronavirus BCV 
Diarrhoea virus of infant mice DVIM 
Human coronavirus strain OC43 HCV-OC43 


Mouse hepatitis virus MHV 
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FIGURE 1. Structures of the O-linked oligosaccharide side chains of the MHV-A59 M glycoprotein. 










determined by a combined gas chromatographic and mass spectrometric anal- 
ysis, are shown in Fig. 1. The branched form was most abundant, comprising 
about 65% of the carbohydrate structures. Both forms were attached to the M 
polypeptide via N-acetylgalactosamine. Of the sialic acids (N-acetylneuraminic 
acid, Neu5Ac), some 20% were identified as the O-acetylated derivative 
Neu5,9Ac, (Niemann et al., 1984). 

The oligosaccharides are attached to the N-terminal region of the M mole- 
cule and exposed at the virion surface. Treatment of MHV (Sturman, 1977; 
Sturman and Holmes, 1977; Rottier et al 1984) and BCV (King and Brian, 1982) 
with various proteases removed some 5kDa from the polypeptide, including all 
the O-linked sugars. Similarly, the N-linked oligosaccharides of the IBV M 
protein were all detached by hydrolysis of the exposed domain of the molecule 
(Cavanagh, 1981; Cavanagh et al., 1986a). Using [*°S|formyl-methionine to ter- 
minally label the polypeptide, this ectodomain was identified as the N-ter- 
minus (Stern et al., 1982; Rottier et al., 1984). 


B. Solubility 


The M protein has unusual solubility properties. This was demonstrated 
for MHV M in the pioneering studies of Sturman and Holmes (Sturman, 1977, 
1981; Sturman and Holmes, 1977; Sturman et al., 1980). When isolated at 4°C 
from virions after nonionic detergent solubilization of the viral membrane, the 
protein formed globular, irregular aggregates of various sizes. Raising the tem- 
perature to 37 °C induced a conformational change in the molecules that led the 
aggregates to associate with the nucleocapsid through interaction with the viral 
RNA (Sturman et al., 1980). Thermosensitivity of the M protein was also 
prominent in sodium dodecyl sulfate (SDS) solution: heating from 25°C or 
37 °C, in which range the protein is in a monomeric state, to 100°C resulted in 
the formation of various self-aggregates, an effect that was even more pro- 
nounced in the presence of reducing agents (Sturman, 1977; Sturman and 
Holmes, 1977). Unlike the M protein of the MHV-A59 strain, the M protein of 
MHV-JHM additionally appeared to form heterogeneous complexes with the S 
glycoprotein under these conditions (Wege et al., 1979; Siddell et al., 1981). The 
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tendency of the MHV M protein to aggregate was also observed after various 
other denaturing treatments (Sturman, 1977). Apparently, any condition that 
promotes the unfolding of the M protein can expose a (probably hydrophobic) 
domain of the molecule which then engages in interactions. 

Aggregation of M in SDS has been described for a number of other corona- 
viruses such as hemagglutinating encephalitis virus (HEV) (Callebaut and Pen- 
saert, 1980), HCV-OC43 (Schmidt and Kenny, 1982; Hogue and Brian, 1986), and 
BCV (Niemann and Klenk, 1981; Deregt et al 1987). On the other hand, it is not 
a universal feature as the effect was not observed after heat denaturation of the 
M polypeptide of IBV (Cavanagh, 1981; Stern and Sefton, 1982b) or HCV-229E 
(Schmidt and Kenny, 1982], not even in the presence of 2-mercaptoethanol. The 
reason for this variable behavior is not understood. The effect seems to be 
independent of the degree and type of glycosylation. Even complete removal of 
the N-terminal ectodomain did not abolish the aggregation properties of the 
polypeptide (Sturman, 1977). 


III. PROTEIN STRUCTURE 
A. Primary Structure 


During the past years, the M genes of several coronaviruses have been 
sequenced. The deduced amino acid sequences of a number of M proteins are 
compiled in the alignment presented in Fig. 2. The primary structures are 225- 
230 amino acids long, with the exception of the transmissible gastroenteritis 
virus (TGEV) and feline infectious peritonitis virus (FIPV) sequences which 
amount to 262 residues, due to an extension at the NH,-terminus. The polypep- 
tides are slightly basic with net charges at neutral pH ranging from +4 
(HCV-229E) to +9 (BCV). Their cysteine content is quite variable, with only two 
such residues in the HCV-229E sequence and nine residues in IBV M. These 
numbers do not explain the effects of reducing agents on the solubility of the 
coronavirus M proteins. 

Pairwise sequence comparisons support the long-standing classification of 
coronaviruses made on the basis of antigenic relationships (McIntosh et al., 
1969; Bradburne, 1970; Pedersen et al., 1978; Horzinek et al., 1982). The M 
proteins of MHV and BCV, viruses of the same antigenic subgroup, are very 
closely related (8696 identity], but differ largely from all the others (e.g., MHV/ 
IBV 29%, MHV/HCV-229E 32%, MHV/FIPV 36%). Similarly, the M sequences 
of TGEV and FIPV, also antigenically related viruses, show a strong homology 
(84% identity), but are only distantly related to the others (e.g., T'GEV/IBV 
1796). The avian IBV M polypeptide has only low homologies to the mammalian 
proteins (e.g., IBV/FIPV 20% identity], in agreement with it being classified 
separately. HCV-229E has previously been placed into the TGEV group on the 
basis of weak serological cross-reactivities (Pedersen et al., 1978; Macnaughton, 
1981). The M protein of this virus has only little sequence similarity with the 
TGEV or FIPV protein (e.g., HCV-229E/TGEV 44% identity) or with any of the 
other M proteins. The same holds true for the S proteins (Wesseling et al., 1994). 
The data therefore support the recent proposition by Sanchez et al. (1990) to 
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MHV- — Á— ese wud uxo iau weder vehe did .MSSTTQAPE PVYQWTADEA VQFLKEWNFS LGIILLFITI 
BCV- EE EE ..MSSVTTPA PVYTWTADEA IKFLKEWNFS LGIILLFITI 
HOV, — Á—  LLCL[L¿L@àabhaw2mauss arwa EE MSNDNCTGDI VTHLKNWNFG WNVILTIFIV 
FBV- —— EECH ..MPNETNCT LDFE....QS VQLFKEYNLF ITAFLLFLTI 
FIPV MKYILLILAC IIACVYGERY CAMQ.DSGLQ CINGTNSRCQ TCFE..RGDL IWHLANWNFS WSVILIVFIT 
TGEV MK.ILLILAC VIACACGERY CAMKSDTDLS CRNSTASDCE SCFN..GGDL IWHLANWNFS WSIILIVFIT 
Consensus  ---------- ---------- ------.--- nee eee -- F----- D- ---L--WNF- --IIL----- 
71 140 
MHV ILQFGYTSRS MFIYVVKMII LWLMWPLTIV LCIFN..CVY ALNN.VYLGF SIVFTIVSIV IWIMYFVNSI 
BCV ILQFGYTSRS MFVYVIKMII LWLMWPLTII LTIFN..CVY ALNN.VYLGF SIVFTIVAII MWIVYFVNSI 
HCV ILQFGHYKYS RLFYGLKMLV LWLLWPLVLA LSIFDTWANW D.SNWAFVAF SFFMAVSTLV MWVMYFANSF 
IBV ILQYGYATRS KVIYTLKMIV LWCFWPLNIA VGVIS..CTY PPNTGGLVA. AIILTVFACL SFVGYWIQSI 
FIPV VLQYGRPQFS WLVYGIKMLI MWLLWPIVLA LTIFNAYSEY QVSRYVMFGF SVAGAVVTFA LWMMYFVRSV 
TGEV VLQYGRPQFS WFVYGIKMLI MWLLWPVVLA LTIFNAYSEY QVSRYVMFGF SIAGAIVTFV LWIMYFVRSI 
Consensus ILQ-G----S -FVY--KM-I LWLLWP---- L-IF----- Y --------- F SI-------- -WI-YFV-SI 
141 210 
MHV RLFIRTGSWW SFNPETNNLM CIDMKGTVYV RPIIEDYHTL TATIIRGHLY MQGVKLGTGF SLSDLPAYVT 
BCV RLFIRTGSWW SFNPETNNLM CIDMKGRMYV RPIIEDYHTL TVTIIRGHLY MQGIKLGTGY SLSDLPAYVT 
HCV RLFRRARTFW AWNPEVNAIT VTTVLGQTYY QPIQQAPTGI TVTLLSGVLY VDGHRLASGV QVHNLPEYMT 
IBV RLFKRCRSWW SFNPESNAVG SILLTNGQQC NFAIESVPMV LSPIIKNGVL YCEGQWLAKC EPDHLPKDIF 
FIPV QLYRRTKSWW SFNPETNAIL CVNALGRSYV LPLDGTPTGV TLTLLSGNLY AEGFKMAGGL TIEHLPKYVM 
TGEV QLYRRTNSWW SFNPETKAIL CVSALGRSYV LPLEGVPTGV TLTLLSGNLY AEGFKIADGM NIDNLPKYVM 
Consensus  -LF-R--SWW SFNPE-N--- -I---G--Y- -P-------- T-T---G-LY --G-----G- ----LP-YV- 
211 272 
MHV VAKVSHLCTY K...RAFLDK VDGVSGFAVY VKSK....VG NYRLPSNKPS G..ADTALLR I. 228 
BCV VAKVSHLLTY KR.GF..LDK IGDTSGFAVY VKSKV....G NYRLPSTQKG SGMDTALLRN NI 230 
HCV VAVPSTTIIY SRVGR..SVN SQNCTGWVFY VRVKH....G DFSAVSSPMS NMTENERLLH FF 225 
IBV VCTPDRRNIY RMVQKYTGDQ SGNKKRFATF VYAKQSVDTG ELESVATGGS SLYT...... $ 225 
FIPV IATPSRTIVY TLVGK..QLK ATTATGWAYY VKSKA....G DYST.EARTD NLSEHEKLLH MV 262 
TGEV VALPSRTIVY TLVGK..KLK ASSATGWAYY VKSKA....G DYST.EARTD NLSEQEKLLH MV 262 
Consensus VA--S----Y ---------- ----- G-A-Y V--K----- G -Y-------- ------- L-- -- 


FIGURE 2. Sequence alignment of the M proteins of MHV-A59 (Armstrong et al., 1984], BCV (Lapps 
et al., 1987), HCV-229E (Raabe and Siddell, 1989), IBV (Baudette strain; Boursnell et al., 1984), FIPV 
(Vennema et al., 1991a), and TGEV (Laude et al., 1987). Not included are the recently determined 
sequences of TCV (Verbeek and Tijssen, 1991), HCV-OC43 (Mounir and Talbot, 1992), CCV 
(Horsburgh et al, 1992), and FECV (A. Herrewegh, H. Vennema, R. de Groot, and P. Rottier, 
unpublished data). Comparison shows the former two to be very similar to the sequences of MHV 
and BCV, while the latter have a high similarity to the sequences of FIPV and TGEV. 


classify HCV-229E in a distinct taxonomic cluster; this suggestion was based on 
an extensive antigenic comparison of coronaviruses using monoclonal anti- 
bodies (see Chapter 1). 

As indicated by its solubility, M is a very hydrophobic protein. It contains 
44-51% of hydrophobic amino acids which are concentrated in the NH,- 
terminal half of the molecule (Fig. 3). Despite the high degree of sequence 
variation between the coronavirus M polypeptides, the hydropathicity profiles 
are remarkably similar. The dominant common feature is the occurrence of 
three hydrophobic domains alternating with short hydrophilic regions. In the 
TGEV and FIPV sequence a fourth hydrophobic domain is present at the NH,- 
terminus, which functions as a cleavable membrane insertion signal (Kapke et 
al., 1988; Vennema et al., 1991b). In the other M proteins, no such signal appears 
to be operative; rather, these proteins have a hydrophilic amino terminus. The 
carboxy terminal half of all M proteins is amphiphilic, with a hydrophilic 
domain at the carboxy end. 

In view of the large differences in primary sequences, the surprising conser- 
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FIGURE 3. Hydropathicity profiles of 
M proteins from MHV-A59, BCV, HCV- 
229E, IBV, FIPV, and TGEV determined 
according to Kyte and Doolittle (1982] 
TGEV using a seven residue moving window. 
Peaks extending upward indicate hy- 
drophobic regions, downward peaks 
correspond to hydrophilic regions. 





vation of the overall chemical features indicates that there are rigid structural 
constraints on M as a result of functional requirements. The highest levels of 
sequence conservation appear in the hydrophobic regions and in the center of 
the polypeptide directly adjacent to the third internal hydrophobic region, 
where a stretch of 8 amino acids is extremely well conserved (see consensus in 
Fig. 2). This suggests a selective pressure on these domains for the maintenance 
of important structural characteristics. 

As mentioned above, coronavirus M proteins are glycosylated in their NH,- 
terminal ectodomain, i.e., the hydrophilic domain preceding the first internal 
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hydrophobic region (see Section II.B). In HCV-229E, IBV, FIPV, and TGEV, the 
protein is N-glycosylated. Potential oligosaccharide attachment sites occur 
once in the relevant part of the HCV-229E M sequence (position 45 in Fig. 2), 
twice in the case of IBV M (positions 35 and 38), and once in the cases of FIPV 
and TGEV M (position 33). These sites are indeed used as the numbers of side 
chains are in agreement with those experimentally observed (Stern and Sefton, 
1982b; Kapke et al., 1988; Vennema et al., 1991b). Another N-glycosylation 
consensus sequence (positions 58—60 in Fig. 2) occurs at the start of the first 
internal hydrophobic region of some M proteins but appears not to be used. 
Apparently, this sequence is not exposed to the modifying enzymes in the 
endoplasmic reticulum and resides within the membrane. 

It is not known which of the serine and threonine residues are substituted 
in the O-glycosylated coronavirus M proteins. The structural features that 
determine a functional O-glycosylation site have not yet been established. In 
MHYV, a cluster of four hydroxy amino acids is located next to the initiating 
methionine which presumably is removed posttranslationally as has been 
shown for IBV M (Cavanagh et al., 1986b). The resulting NH,-terminal tetrapep- 
tide sequence (Ser-Ser-Thr-Thr) is identical to the O-glycosylated amino- 
terminus of glycophorin A, the major glycoprotein of the human erythrocyte 
membrane. It was shown by Niemann et al. (1984) that the O-linked sugar 
structures of MHV M are identical to those found in glycophorin. As the 
coronavirus protein also appeared to exhibit blood group M activity, as does 
glycophorin, the authors inferred that the hydroxy amino acid cluster contains 
the functional oligosaccharide acceptor sites in MHV M. On the basis of the 
heterogeneity in the glycosylation of the M protein, they concluded that up to 
three of the four residues in the cluster are modified by oligosaccharide side 
chains. In BCV a slightly different cluster of hydroxy amino acids occurs at the 
NH,-terminus due to the presence of a valine residue (Fig. 2). Assuming that the 
sequence requirements for glycosylation allow for this difference, Lapps et al. 
(1987) suggested, by analogy, that the additions of the glycans take place in this 
terminal segment. They argued that, in the case of BCV, up to two O-linked 
oligosaccharide side chains per M molecule are attached. 

Recent sequence information shows that considerable genetic diversity 
exists among different strains or isolates of coronaviruses in their spike protein 
(e.g., for MHV, see Luytjes et al., 1987; Schmidt et al., 1987; Parker et al., 1989; 
Gallagher et al., 1990; Wang et al., 1992). The limited data available for the M 
gene suggest that variations are less extensive. Comparison, for instance, of the 
A59 and JHM strains of MHV (Pfleiderer et al., 1986) revealed 21 nucleotide 
changes scattered over the entire molecule, which result in only seven conser- 
vative amino acid changes (3.5%). Cavanagh and Davis [1988] analyzed the 
NH,-terminal domain, including the first hydrophobic region, of the M se- 
quence of 23 strains of IBV. Both base substitutions and small deletions and 
insertions were detected. A fourfold greater extent of amino acid variation was 
found in the ectodomain of the protein as compared to the membrane-embedded 
segment. Notably, one of the N-glycosylation sequences (positions 38-40 in 
Fig. 2) was highly conserved while the other (positions 35-37) was not. Also, 
based on a complete comparison of M sequences of two IBV strains (Binns et al., 
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1986), it was concluded that the exposed NH,-terminal domain is the most 
variable part of the molecule. The M proteins of TGEV and PCRV (Rasschaert et 
al., 1990|, two closely related porcine coronaviruses, also differ in their amino 
acid sequences at 13 positions, 8 of which occur in the amino terminal part 
preceding the first hydrophobic region. 


B. Membrane Topology 


The disposition of the M molecule in the lipid bilayer was studied through 
protease protection analyses. Digestion of in vitro assembled MHV-A59 M 
protein showed the bulk of the polypeptide to be resistant to proteolysis: only a 
small (1.5-kDa) portion was removed when the membranes were intact while 
another 2.5-kDa fragment was digested after detergent permeabilization (Rot- 
tier et al., 1984). The latter fragment is located luminally and represents the 
NH,-terminus of the molecule, as was shown by selective labeling. In virions, 
this 2.5-kDa fragment is exposed on the outside and is glycosylated to a variable 
extent, while the 1.5kDa COOH-terminal end protrudes from the inner face of 
the membrane. Experiments with IBV (Cavanagh et al., 1986a) suggest that the 
tertiary structure of the M protein of this virus, and probably of other corona- 
viruses as well, is very similar. 

The results of the biochemical studies, combined with a theoretical anal- 
ysis of the primary structure of the M polypeptide, have led to a general topo- 
logical model of the assembled protein as shown in Fig. 4A (Armstrong et al., 
1984; Rottier et al., 1986). The structure is characterized by the presence of three 
membrane-spanning helices in the NH,-terminal half that anchor the protein 
in the lipid bilayer. This segment is flanked on the one side by the hydrophilic 
NH,-terminus and on the other side by a region that contains the extremely 
well-conserved 8 amino acids domain (Fig. 2) and in which a surface helix is 
predicted for some (but not all) M proteins [Rottier et al., 1986). The bulk of the 
carboxy-terminal half of the M molecule is supposedly embedded in the polar 
surface of the membrane. In line with this, a mutant M protein lacking all the 
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FIGURE 4. Topological models of the membrane-assembled M protein. Note that the two models 
differ only in the disposition of the protease-resistant region in the carboxy-terminal half. The 
hexagon symbol attached to the NH,-terminal region indicates the potential glycosylation sites. 
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membrane-spanning domains was found to associate with membranes in vitro 
(Mayer et al., 1988]. However, a location of this extremely protease resistant 
region adjacent to the membrane, as in Fig. 4B, cannot be excluded. 


IV. ASSEMBLY IN THE ENDOPLASMIC RETICULUM 


Viral proteins use mechanisms and intracellular pathways as do normal 
cellular proteins. The process by which a nascent polypeptide is directed to and 
assembled in the endoplasmic reticulum occurs quite rapidly. Therefore, these 
processes are most conveniently studied in in vitro systems, which are easier to 
manipulate and in which the reaction kinetics are inherently slower. Though 
these systems have given us insight into the processes involved in the insertion 
and translocation of simple membrane and secretory proteins, little is known 
about the events that generate the more complex, multispanning membrane 
proteins. The coronavirus M protein, relatively simple as it is with only three 
transmembrane domains, thus represents an attractive model. 

The MHV-A59 M protein is translated on membrane-associated polysomes 
(Niemann et al., 1982). Its membrane insertion involves the action of the 
ribonucleoprotein complex called the signal-recognition particle (SRP). In a 
wheat germ translation system devoid of membranes, the synthesis of the 
protein could be specifically and stably blocked by the addition of SRP. Subse- 
quent addition of salt-washed microsomes fully released the translation arrest 
and resulted in the correct membrane integration of the polypeptide (Rottier et 
al., 1985). 

In agreement with the absence of a hydrophobic NH,-terminal peptide, the 
protein is assembled without a cleavable signal sequence (Rottier et al., 1984). 
The polypeptide chain is inserted into the lipid bilayer in a cotranslational 
manner. Time course experiments in a synchronized in vitro translation of the 
M mRNA showed that the nascent polypeptide chain was able to integrate 
when membranes were added before 140—150 of the 228 residues had been 
polymerized. Later additions no longer allowed membrane insertion. Once 
beyond a critical point in synthesis, the domain containing the insertion sig- 
nal(s) is apparently no longer accessible to the insertion machinery. Additional 
time course experiments revealed that such signal(s)] may be located anywhere 
within the hydrophobic NH,-terminal half of the molecule. During a syn- 
chronized translation in a wheat germ extract, SRP was able to induce an arrest 
until the most C-terminal hydrophobic domain had emerged from the ribosome 
(Rottier et al., 1985). 

Direct evidence that the topogenic signals in the M protein reside within 
the hydrophobic part of the polypeptide came from expression studies with 
cloned cDNA copies of M genes. In vitro mutagenesis followed by transcription 
and translation in the presence of microsomal membranes showed that each 
hydrophobic domain can individually insert and anchor the polypeptide in the 
membrane (Mayer et al., 1988; Rottier et al., 1990; Krijnse Locker et al., 19925). 
Similar results were obtained by expression of mutant M genes of MHV-A59 M 
(Mayer et al., 1988; Armstrong et al., 1990; Krijnse Locker et al., 1992b| and IBV 
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M (Machamer and Rose, 1987). The mutant proteins with only the first or the 
third transmembrane domain integrate in membranes such that their NH,- 
terminus is translocated into the lumen, while their COOH-terminus remains 
on the cytoplasmic side. Large deletions in the hydrophilic NH,-terminal re- 
gion did not affect this ability, nor did the mutations alter the topology of the 
assembled protein (Mayer et al., 1988; Krijnse Locker et al., 1992b). This part of 
the protein consequently plays no role in the membrane integration process. As 
expected, similar conclusions could be drawn from studies of mutations in the 
carboxy-terminal domain (Armstrong et al., 1990; Rottier et al., 1990; Krijnse 
Locker et al., 19925]. 

These observations lead to the model shown in Fig. 5. SRP interacts with 
the first hydrophobic domain as soon as it appears from the ribosome; elonga- 
tion halts until the complex has attached to the membrane where the hydro- 
phobic domain interacts with the signal sequence receptor and is inserted into 
the membrane, probably as a hairpin, while SRP is released. Presumably, the 
hydrophilic NH,-terminus is translocated and the two following hydrophobic 
domains are then sequentially inserted. Completion of the polypeptide chain 
accompanied by further cotranslational folding finally leads to the fully assem- 
bled protein. 

Interestingly, the deduced amino acid sequences of the TGEV (Laude et al., 
1987; Kapke et al 1988], FIPV (Vennema et al., 1991a], and canine coronavirus 
(CCV) M protein (Horsburgh et al., 1992) have a hydrophobic amino terminal 
extension (see Section III.A). Sequencing of the mature M polypeptide from 
purified TGEV showed that the first 17 residues were absent, indicating a 
cleavable signal sequence (Laude et al 1987). Translation studies with mRNA 
specifying TGEV M showed the signal-directed membrane insertion of the 
protein in vitro, although cleavage did not occur under these conditions (Kapke 
et al., 1988). These authors also tested the expression of a truncated version of 
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FIGURE 5. Model of the membrane assembly process of the coronavirus M protein. 
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the gene which lacked the information for the signal-containing first 21 resi- 
dues. Though the polypeptide appeared to be integrated efficiently into micro- 
somal membranes, translocation of the NH,-terminal domain was poor, as 
judged from the small extent of N-glycosylation of the protein. In contrast, 
deletion of the signal sequence had no effect when the FIPV M protein was 
expressed in a vaccinia virus/cell system (Vennema et al., 1991b). The mutant 
protein was glycosylated as efficiently as the wild-type protein. Why, in these 
viruses, the M protein contains a cleavable amino-terminal signal sequence 
thus remains unclear. 


V. INTRACELLULAR TRANSPORT AND MATURATION 
A. Transport and Processing in Coronavirus-Infected Cells 


Intracellular transport of the coronavirus M protein differs from that of 
most other viral glycoproteins, including the coronaviral spike protein. Where- 
as these proteins are usually targeted to the cell surface, migration of the M 
protein is limited to the perinuclear region of the cell. This restricted mobility 
correlates with the intracellular location of coronavirus assembly in the in- 
fected cell. 


1. Intracellular Budding of Coronaviruses 


Already from the early studies of the coronavirus infection process, it 
became clear that the entire replication cycle takes place in the cytoplasm. 
Ultrastructural observations with different coronaviruses demonstrated that 
morphogenesis occurs at intracellular membrane-bound compartments as vir- 
ions were seen in the lumina of the rough endoplasmic reticulum, smooth- 
walled vesicles and Golgi apparatus (e.g., David-Ferreira and Manaker, 1965, 
Becker et al., 1967; Chasey and Alexander, 1976; Holmes and Behnke, 1981; 
Dubois-Dalcq et al., 1982, 1984}. Virus budding never takes place at the plasma 
membrane. A thorough investigation by J. and S. A. Tooze et al. (J. Tooze et al., 
1984, 1985, 1987; S. A. Tooze et al., 1988], using a combination of biochemical 
and morphological techniques, has elucidated the temporal sequence of events 
in the maturation of MHV-A59 in Sac(—) cells, a line of transformed murine 
fibroblasts. Early in infection, shortly after the appearance of M and S, the first 
progeny virions are seen by electron microscopy in the perinuclear region. 
Budding of these early particles occurs into small, smooth vesicles or tubules 
lying between the rough endoplasmic reticulum and the cis side of the Golgi 
stack. This smooth membrane compartment, termed budding compartment by 
the authors (S. A. Tooze et al., 1988], is distinct from and does not form part of 
the Golgi complex but is connected to the rough endoplasmic reticulum 
(Krijnse Locker et al., 1994a). At early times, budding does not occur in either of 
these two organelles. Later, however, budding also starts to occur into the rough 
membranes of the endoplasmic reticulum, and this compartment becomes the 
major site of virus assembly late in infection. Budding into the Golgi cisternae 
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is a rare event. Virions are transported from their site of synthesis to and 
through the Golgi complex by vesicular carriers. In the Golgi complex they are 
usually seen only at the rims of the stacks. At the trans side of the Golgi system, 
particles are collected into vesicles of the constitutive exocytic pathway |]J. 
Tooze et al., 1987) and released from the cell. 

The budding compartment also exists in uninfected Sac(—) cells, where it is 
associated with transitional elements and vesicles of the rough endoplasmic 
reticulum and the cis side of the Golgi complex. Presumably, it is equivalent 
to the intermediate or salvage compartment which plays a crucial role in the 
sorting of resident proteins of the endoplasmic reticulum (for review, see Hauri 
and Schweizer, 1992). As virions accumulate in the budding compartment 
during MHV-A59 infection, it becomes strongly dilated. 

Budding into the intermediate compartment seems to be a general feature 
of coronaviruses. An ultrastructural analysis of cells infected with IBV, TGEV, 
or FIPV revealed that these viruses also use the smooth perinuclear membranes 
(Klumperman et al., 1994). Budding into such tubulovesicular membrane struc- 
tures has also been described in neural cells infected with either MHV-A59 or 
the JHM strain of MHV (Dubois-Dalcq et al., 1982). 


2. Localization and Transport of the M Protein 


After its synthesis on membrane-bound polyribosomes, the M protein is 
transported from the endoplasmic reticulum and through the Golgi complex to 
finally appear outside the cell as part of virions. The rate and extent of this 
process differs depending on the particular virus-cell system and on the time of 
infection. Pulse-chase experiments early in MHV-A59 infection showed the M 
protein to be chased almost quantitatively from 17C11 cells within about 2 hr 
(Holmes et al 1981a). In contrast, very inefficient clearance of M was observed, 
both from 17C11 cells (Holmes et al., 1981b) and from Sac(- ) cells (Rottier et al., 
1981b) during a similar period somewhat later in the infection cycle. It appears 
that in early stages the synthesis of the protein is balanced with its release in 
virions. 

To relate the localization of the MHV-A59 M protein to the site of budding, 
J. and S. A. Tooze et al. (J. Tooze et al., 1984; S. A. Tooze et al., 1988) labeled 
infected Sac(—) cells for indirect immunofluorescence with a monoclonal anti- 
body to the protein. Early in infection, a perinuclear fluorescence pattern was 
observed that was similar to but more extensive than that obtained with anti- 
bodies specific for the Golgi apparatus. As infection proceeded, a more reticular 
staining throughout the cell appeared. No labeling of the plasma membrane 
occurred until late in infection when patches of fluorescence indicated associa- 
tion of released virions with the cell surface. Immunoperoxidase staining con- 
firmed that the M protein had accumulated in the budding compartment and 
not in the endoplasmic reticulum early in infection. There was, however, also 
some labeling of the stacked cisternal membranes of the Golgi complex, indi- 
cating that some M protein reaches the Golgi apparatus as free integral mem- 
brane protein. As no virus assembly is observed in these compartments, it was 
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inferred that the density of the protein in this organelle apparently does not 
reach the critical threshold level required for budding. 

Similar data were obtained by Klumperman et al. (1994) in a study of the 
subcellular localization of M proteins in coronavirus-infected cells. Using im- 
munogold labeling and electron microscopy, they demonstrated that in MHV 
and IBV infection, both in Sac(—) cells, free M protein is present in the mem- 
branes of the budding compartment as well as in the Golgi complex. Within the 
Golgi complex, the distribution patterns of MHV-M and IBV-M were found to be 
different. MHV-M was localized more toward the trans side, while IBV-M was 
concentrated on the cis side, similar to the patterns found when these proteins 
are expressed independently (Machamer et al., 1990; Krijnse Locker et al., 
1992a; Klumperman et al 1994). 

The combined data indicate that at early times the M protein is transported 
as an integral membrane protein to the smooth membranes of the budding 
compartment where it accumulates and is incorporated into virions and from 
where it is exported through the Golgi complex out of the cell. At later times, 
when the rate of M synthesis in the rough endoplasmic reticulum exceeds the 
rate of its exit to the budding compartment, the protein allows virion assembly 
also in these reticular membranes. A fraction of M apparently escapes from 
being incorporated into virions and accumulates in the Golgi complex. 

Interestingly, when the temperature of Sac(- cells infected with MHV- 
A59 was lowered to 31 °C, release of virions was strongly inhibited (S. A. Tooze 
et al., 1988). Electron microscopic analysis showed that virion formation at this 
temperature occurred normally, but that the particles accumulated in the bud- 
ding compartment and in the rough endoplasmic reticulum. Entry of virions 
into the Golgi complex appears to be inhibited at 31?C, as few virions were 
observed in or beyond this compartment and the M protein did not undergo the 
oligosaccharide modifications known to occur there. 

MHV M protein is not transported as free integral membrane protein to the 
plasma membrane of infected cells (Dubois-Dalcq et al., 1982; J. Tooze et al., 
1984, 1987; S. A. Tooze et al., 1988; S. A. Tooze and Stanley, 1986). The protein is 
detected there only in extracellular virions that have readsorbed in clusters to 
the cell surface (Sugiyama and Amano, 1981; Dubois-Dalcq et al., 1982; S. A. 
Tooze et al., 1988). This is in keeping with the Golgi localization of MHV-M 
when expressed from its cloned gene (see Section VB). It was surprising, there- 
fore, that free M protein was observed at the surface of cells infected with 
TGEV. This was shown both biochemically (surface iodination) and serologi- 
cally using M-specific monoclonal antibodies by Laviada et al. (1990). The 
protein was detected at the plasma membrane as early as 4 hr after infection, 
i.e., before any infectious virus had been released from the cells. In another 
study, however, Pulford and Britton (1990) could not confirm these findings. 


3. Oligosaccharide Modifications during Transport 


Transport of glycoproteins through the biosynthetic pathway of the cell is 
accompanied by modifications of their glycan moieties as they encounter the 


128 PETER J. M. ROTTIER 


modifying enzymes. Due to the specific locations of these enzymes, the mod- 
ifications occur sequentially in time and place. As a consequence, the state of 
glycosylation is an indicator of a glycoprotein's progress on the exocytic route. 
Many of the oligosaccharide-processing enzymes occur in the Golgi apparatus. 
The formation of coronavirions in a pre-Golgi compartment thus implies that 
these enzymes act on glycoprotein molecules which present themselves as 
parts of huge macromolecular structures. 

The M protein of a number of coronaviruses is glycosylated only by 
O-linked carbohydrates (see Table I). This type of linkage is rare among viral 
glycoproteins, but is found quite frequently in various cellular glycoproteins. 
Usually, O-linked oligosaccharides occur on a polypeptide in combination with 
N-glycosidically linked side chains. Sometimes the O-glycosylation pattern is 
very extensive and complex such as in mucin-type molecules. For these and 
other reasons, O-glycosylation has been difficult to study, and our knowledge 
lags far behind that of N-glycosylation. Because of its relative simplicity, the 
coronavirus M protein seems an attractive tool to catch up. 

The M protein of MHV-A59 has been resolved into a number of differen- 
tially O-glysosylated forms. As mentioned previously, two types of oligosac- 
charide side chains are bound to M in the virions produced by 17C11 cells (see 
Fig. 1) (Niemann et al., 1984). The same structures were found to predominate 
when the virus was grown in Sac(—) cells (S. A. Tooze et al., 1988], but at least 
two additional glycosylated species were detected in these cells. One was 
identified as carrying only N-acetyl-galactosamine (GalNAc); the other was not 
identified. Two glycosylated forms were observed in AtT20 cells, a pituitary 
tumor cell line, but their structures were not clearly resolved (J. Tooze et al., 
1987). In all cases, a significant proportion of the M molecules remained un- 
glycosylated. These observations indicate that the extent and possibly also the 
nature of O-glycosylation varies and is determined by the host cell. 

Early studies on the biosynthesis of MHV-A59 M, both in infected Sac(- ) 
cells (Rottier et al 1981b) and in 17C11 cells (Holmes et al., 1981b; Niemann et 
al., 1982), suggested that the addition of O-linked oligosaccharides is a post- 
translational event. This idea is now well established (S. A. Tooze et al., 1988; 
Krijnse Locker et al., 1992a, 1994a). The M protein is synthesized in a non- 
glycosylated form in the rough endoplasmic reticulum and is transported to the 
intermediate compartment where the first sugar, GalNAc, is added. Due to the 
membrane-continuities between these compartments, no vesicular transport 
step is required for this addition (Krijnse Locker et al., 1994a]. All subsequent 
modifications occur in the Golgi apparatus. First, a galactose unit is added, 
followed immediately by sialic acid. The transferase enzymes responsible for 
these additions are both located in Golgi cisternae, proximal to the trans-Golgi 
network. Finally, further modifications can lead to the appearance of two more 
forms of the M protein, the oligosaccharides of which have not been charac- 
terized. These modifications occur in the trans-Golgi network as they can be 
inhibited specifically by treatment of the cells with the drug brefeldin A 
(Krijnse Locker et al., 19923]. 

Coronavirus M proteins that are modified by N-linked oligosaccharides 
acquire their sugars cotranslationally in the endoplasmic reticulum. Core- 
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glycosylation is, however, not a very efficient process with these proteins. 
Unglycosylated and partially glycosylated molecules occur in infected cells as 
well as in the virions released from them (Stern and Sefton, 1982a, b; Stern et al., 
1982; Cavanagh, 1981, 1983; Vennema et al., 1990b; Jacobs et al 1986). Incom- 
plete glycosylation does not correlate with the involvement of a cleavable NH,- 
terminal signal sequence during biosynthesis; it was observed both with IBV 
and with FIPV and TGEV M proteins which have an internal and an amino 
terminal insertion signal, respectively. During transport to the budding com- 
partment, and on passing through the Golgi complex en route to the plasma 
membrane, the high-mannose oligosaccharides are trimmed and processed, 
having been converted to the complex type as they are released from the cell. 
Again, these processes occur very inefficiently, since a significant proportion of 
the M molecules in extracellular virions remains sensitive to the enzyme 
endoglycosydase H which only recognizes immature glycans (Stern and Sefton, 
1982b; Cavanagh, 1983; Vennema et al., 1990b). The incomplete maturation of 
coronavirus glycoproteins is not specific for the M protein. The same applies to 
the S protein (Stern and Sefton, 1982b; Cavanagh, 1983; Vennema et al.,1990b). 
The reasons for these findings are unclear, but it is plausible that they are 
caused by steric effects. Since the proteins move through the Golgi compart- 
ments as part of and protruding from the viral envelope, access of the modifying 
enzymes to the oligosaccharides might well be severely hindered. Incomplete 
maturation of N-linked carbohydrates is not unprecedented and has occa- 
sionally been observed with other glycoproteins (Doyle et al 1986; Geyer et al., 
1988; Earl et al., 1991). 


B. Transport of the Expressed M Protein 


The interesting membrane structure and the intracellular restriction in 
coronavirus-infected cells prompted studies of the biogenesis and transport of 
the M protein in the absence of other coronaviral proteins. The M proteins of 
MHV-A59, IBV, TGEV, and FIPV have been expressed in cells from cloned 
cDNA (e.g., Machamer and Rose, 1987; Rottier and Rose, 1987; Machamer et al., 
1990; Klumperman et al., 1994) as well as by microinjection of an in vitro 
transcribed mRNA (Armstrong et al., 1987, 1990; Mayer et al., 1988). These 
studies unequivocally demonstrate that the protein accumulates in the Golgi 
apparatus and does not reach the plasma membrane. Analysis of the expressed 
MHV-A59 M protein by immunofluorescence in various cell types revealed the 
perinuclear appearance typical for the Golgi complex (Armstrong et al 1987; 
Rottier and Rose, 1987; Mayer et al., 1988; Krijnse Locker et al., 19923; Klumper- 
man et al., 1994 [see also Fig. 6). This localization was confirmed by electron 
microscopy using immunogold labeling and Golgi-specific markers (Krijnse 
Locker et al., 1992a; Klumperman et al., 1994), showing that within the Golgi 
complex the MHV M protein is concentrated in the trans-most compartments. 
Consistently, biochemical labeling revealed that, although a proportion of the 
molecules usually remained unglycosylated, as in coronavirus-infected cells, 
the large majority acquired the O-linked oligosaccharides added in the Golgi 
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FIGURE 6. Localization of the MHV-A59 M and S protein by indirect immunofluorescence. Sac(-) 
cells expressing the M protein from (A) a vaccinia virus vector or, (B, C) during infection with MHV- 
A59 were stained with a M-specific antipeptide serum (A, B) or an S-specific monoclonal antibody 
after permeabilization. 


apparatus (Armstrong et al., 1987; Rottier and Rose, 1987; Krijnse Locker et al., 
19923; Klumperman et al., 1994). In contrast, the IBV M protein expressed in 
AtT20 cells (Machamer et al., 1990) or in HepG2 cells (Klumperman et al., 1994) 
was localized by immunoelectron microscopy to the cis side of the Golgi 
complex. Accordingly, its N-linked sugars remain largely immature. Taken 
together, these studies indicate that the intracellular restriction of coronavirus 
M is an intrinsic property of the protein and is not dependent on other corona- 
viral factors. 

Its accumulation in the Golgi apparatus, midway on the exocytic pathway, 
has made the M protein an attractive tool for studying the principles of target- 
ing to and retention in this organelle. For the IBV protein, Machamer and co- 
workers (Machamer and Rose, 1987; Machamer et al., 1990, 1993; Swift and 
Machamer, 1991) have shown that the first transmembrane domain carries the 
signal necessary and sufficient for localization in the cis-Golgi region. This was 
demonstrated most convincingly by transferring the hydrophobic domain into 
reporter molecules normally transported to the plasma membrane and showing 
their Golgi retention (Swift and Machamer, 1991). Several uncharged polar 
residues were found to be critical for the functioning of the domain. These 
residues line one face of a predicted a-helix formed by the transmembrane 
domain (Swift and Machamer, 1991; Machamer et al., 1993). Surprisingly, no 
such role is played by the first transmembrane domain of the MHV M protein. A 
mutant M protein with only this transmembrane domain did not leave the 
endoplasmic reticulum (Armstrong et al., 1990; Rottier et al., 1990), although 
inserting this domain in place of the membrane-spanning domain of a plasma 
membrane protein still allowed this chimeric molecule to reach the cell surface 
(Machamer et al., 1993). Mutation studies with the MHV M protein indicate 
that, in this case, the carboxy-terminal domain, probably in combination with 
an internal domain, determines the Golgi localization (Armstrong and Patel, 
1991; Krijnse Locker et al. 1994b]. It was suggested that different principles may 
act in the differential localization of the IBV and MHV M proteins in the Golgi 
complex (Weisz et al., 1993). 
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Glycoproteins of enveloped viruses are involved in a number of biological 
activities. Envelope proteins function in the binding of virions to cell receptors. 
They mediate the introduction of the nucleocapsid into the cytoplasm by 
triggering the fusion of viral and cellular membranes. Viral membrane proteins 
are also essential at the end of the infection cycle during the process of virion 
assembly. In some viruses, envelope glycoproteins exhibit receptor-destroying 
activities. Finally, the proteins generally induce immunological responses in 
the host by eliciting neutralizing antibodies and cytotoxicity. In coronaviruses, 
several of these properties have been attributed to the S and HE proteins. The M 
protein has been demonstrated to play a key role in coronavirus budding and to 
induce immunological reactions during infection in the host. 


A. Role of M in Coronavirus Budding 


There are two types of findings pointing to a predominant function of the 
M protein in the intracellular formation of progeny virus particles. First, growth 
of coronaviruses in the presence of tunicamycin gave rise to the production of 
spikeless, noninfectious virions (Holmes et al., 1981b; Rottier et al., 1981b; 
Stern and Sefton, 1982b; Mounir and Talbot, 1992). These particles were indeed 
devoid of S and HE protein but contained M, suggesting that M is the only 
envelope glycoprotein required for virus budding. Second, throughout the infec- 
tion there is a correlation between the intracellular sites at which progeny 
virions bud and the perinuclear location of the M protein (see Section V). 
Concentration and location of M apparently are decisive in determining where 
and when budding occurs. 

Molecular details of the budding process have not yet been elucidated. 
Conceivably, the M protein is transported as an integral membrane protein to 
the budding compartment or, later in infection, backs up in the endoplasmic 
reticulum. The protein accumulates locally to a concentration high enough to 
be recognized by nucleocapsids formed in the cytosol. The nucleocapsids asso- 
ciate with the M protein by interacting with its cytoplasmic domain. These 
interactions initiate budding into the lumen of the membrane compartment, 
whereby M is sequestered into progeny virions. An affinity of M for nucleocap- 
sids has been observed in vitro with several coronaviruses. Subviral particles 
prepared by Nonidet P-40 disruption of purified MHV (Wege et al., 1979; Stur- 
man et al., 1980), HEV (Callebaut and Pensaert, 1980), IBV (Lanser and Howard, 
1980], or BCV (King and Brian, 1982] still contained M protein associated with 
the nucleocapsids. The significance of these interactions remains to be as- 
sessed. Binding of MHV-M to nucleocapsids did occur through an interaction 
with the RNA but was not specific for viral RNA and was dependent on a 
temperature-induced conformational change in the M protein (Sturman et al., 
1980). The distribution of positive charges over the M molecule certainly seems 
to favor an association with the viral RNA. For instance, all but one of the 18 
arginines and lysines present in the MHV M protein are located at the cyto- 
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plasmic face of the intracellular membrane; of these, 5 occur in the carboxy- 
terminal 24 amino acids. 

Little is still known about where and how the other viral membrane 
proteins are assembled into coronavirions. Electron microscopic observations 
have shown that the envelopes of budding virions were covered with peplomers 
(Chasey and Alexander, 1976; Dubois-Dalcq et al 1982). Apparently, budding 
involves the collective incorporation of all viral envelope proteins. This implies 
that the spike glycoproteins convene with the M proteins at the sites of budding 
and, by consequence, that the different viral envelope proteins recognize each 
other and interact. Association of the M and S protein in MHV-A59 infected 
cells has indeed been demonstrated (Opstelten et al 1993a, 1994). After their 
synthesis, M molecules rapidly engage in formation of noncovalently linked 
heteromultimeric structures with S proteins, probably already in the endo- 
plasmic reticulum. In contrast, newly synthesized S protein first has to undergo 
a number of folding reactions before it reaches a conformation competent to 
interact with M. If proper folding of S is prevented, e.g., by inhibiting the 
formation of disulfide bonds, no complexes between the two proteins are 
formed and the M protein is transported to the Golgi complex (Opstelten et al., 
1993b]. 

These data suggest a model of virus assembly in which the viral envelope 
proteins form complexes that accumulate in the budding compartment to 
generate a microenvironment where nucleocapsids can bind and assemble into 
virions. Such a process would explain the exclusion of host cellular membrane 
proteins from virions as the molecular selection for viral membrane proteins 
would preclude their incorporation into the heteromultimeric complexes. It 
might also explain why the M and S proteins are efficiently assembled into viral 
particles in the budding compartment, while their intrinsically preferred desti- 
nations are the Golgi complex and the plasma membrane, respectively. Consis- 
tent with such a model, the MHV M and S proteins when coexpressed in cells 
were found to associate and form complexes (D.-J. Opstelten and P. Rottier, 
unpublished observations). Surprisingly, however, these complexes were not 
retained in the budding compartment, but accumulated in the Golgi complex. 
Apparently, other viral factors are required to localize budding in pre-Golgi 
membranes. One possibility is that the nucleocapsid plays an organizing role. 
Alternatively, the small nonglycosylated virion membrane protein (SM) might 
be important. So far, this protein has largely been neglected. Further studies are 
warranted, however, because the protein was recently shown to play a key role 
in the formation of viruslike particles (H. Vennema, G.-J. Godeke, and P. Rot- 
tier, unpublished observations). 

While the M protein is required for budding, its glycosylation is not. Non- 
glycosylated coronavirus M proteins are able to efficiently direct the formation 
of virions as was shown using tunicamycin (Stern and Sefton, 1982b} and 
monensin (Niemann et al., 1982), inhibitors of N- and O-glycosylation, respec- 
tively. Moreover, in the absence of drugs no particular form of the differentially 
glycosylated M species is preferentially incorporated into virions. Their relative 
abundance in extracellular viral particles correlates with the ratio of their 
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appearance in the infected cell (Stern and Sefton, 1982b; Holmes et al., 198143). 
Additional evidence that glycosylation of M is not essential for virus assembly 
comes from the demonstration by Laude et al. (1992) that disruption of the 
consensus sequence of the sole N-glycosylation site in the M protein of TGEV 
results in a viable mutant virus. The authors mention that the mutation did not 
affect the specific infectivity of the virus nor its multiplication rate. Appar- 
ently, the oligosaccharides attached to the coronaviral M proteins are not 
important for the infection process at the level of the cell. They probably play a 
role in the interaction with the host at the level of the organism. 


B. Induction of Immunological Responses 


With the exception of FIPV (Vennema et al., 1990a), the S glycoprotein of 
coronaviruses is believed to be the prime inducer of protective immunity. 
Nevertheless, several observations indicate that immune responses to other 
viral proteins, including the M protein, may also play a role. Though largely 
buried within the membrane, the M protein does elicit specific antibodies 
during infection in the host and also when expressed separately through a live 
carrier virus (e.g., Pulford and Britton, 1990; Vennema et al., 1991a; Wesseling et 
al., 1993]. 

Monoclonal antibodies to the M protein can neutralize infectivity in vitro, 
but only in the presence of complement, as was shown for some monoclonal 
antibodies to MHV-M (Collins et al., 1982; Fleming et al., 1989) and TGEV-M 
(Woods et al., 1988). In most cases, however, anti-M antibodies appear non- 
neutralizing in vitro (Buchmeier et al., 1984; Laude et al., 1986; Deregt and 
Babiuk, 1987; Fiscus and Teramoto, 1987; Fleming et al., 1989). 

Little is known about the protective effects of antibodies to the M protein 
in animals. Monovalent antibodies elicited by immunization of mice with 
purified MHV M protein failed to protect against a virus challenge (Hasony and 
Macnaughton, 1981). In contrast, two of four monoclonal antibodies to MHV- 
JHM M did protect mice from a normally lethal challenge after passive transfer 
(Fleming et al., 1989). This protection was not associated with a particular 
antigenic determinant in the M protein nor was it mediated by complement. 
Interestingly, one of the two antibodies was not neutralizing in vitro, not even 
in the presence of complement. Some level of protection against another coro- 
navirus was also observed in cats after immunization with a recombinant 
vaccinia virus expressing the FIPV M protein (Vennema et al., 1991a). Although 
all animals seroconverted after a challenge with a lethal dose of FIPV and 
developed clinical signs, three of eight kittens survived. 

The M protein of TGEV can induce the production of a-interferon in 
lymphocytes as was shown by the inhibiting effect of anti-M monoclonal 
antibodies on interferon induction by fixed TGEV-infected cells (Charley and 
Laude, 1988). By analysis of mutant viruses with reduced interferogenic poten- 
tial, this biological activity was mapped to the exposed amino-terminal region 
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of the M molecule (Laude et al 1992). The significance of a-interferon action 
for immunity to viral infection needs to be established. 

Little is known about the cellular immune response to coronavirus infec- 
tions. A cytotoxic T-cell response to the M protein has not been described. It has 
been suggested, however, that cell-mediated recognition of the M protein might 
be an important part of an effective immune response to MHV-JHM (Mobley et 
al., 1992). 


VII. SUMMARY AND PERSPECTIVES 


The coronavirus M protein has a unique molecular structure. The different 
domains in its structure may explain the specific properties of the protein and 
reflect its biological features in infection. 

The N-terminal hydrophilic virion ectodomain carries the N- or O-linked 
oligosaccharides. It is the most variable part of the molecule which presumably 
contains the major antigenic determinants. It is responsible for immunological 
reactions such as antibody and interferon induction. 

The hydrophobic region containing the three transmembrane helixes con- 
stitutes the core of the protein and is probably responsible for its peculiar 
physical properties. This region dictates the protein's topology in the mem- 
brane and its intracellular transport. It is the part of the protein most likely to be 
engaged in the intermolecular interactions that must occur at the membrane 
during the virion assembly process, both mutually between M molecules and 
with other viral membrane proteins. 

The carboxy-terminal half of the molecule consists of an amphiphilic 
portion and an exposed tail facing the cytoplasm in infected cells. Virion bud- 
ding is accomplished by the association of nucleocapsids with this part of the M 
protein. 

The data reviewed in this chapter demonstrate that the coronavirus M 
protein has a number of interesting features both as a virion protein and as a 
model membrane protein. Though much has already been learned, many funda- 
mental questions remain to be answered. Clearly, more needs to be known 
about the precise mechanism by which the protein is assembled in the mem- 
brane, about the disposition of the amphiphilic region of the molecule, and 
about the signals governing its intracellular transport. Little is known about the 
molecular details of the assembly of coronaviruses, the interactions of the M 
proteins with the nucleocapsid, or the interactions with the other membrane 
components. Nothing is known about the importance of the differential glyco- 
sylation among coronaviral M proteins. Some of these questions may be ap- 
proached through mutagenesis and expression of the M gene. The answer to 
most questions, however, will require the generation of mutant viral genomes, 
either by the manipulation of infectious cDNA clones or by RNA recombina- 
tion. This holds true not only with respect to the role of the M protein but for 
the molecular analysis of coronavirus infection in general. Evidently, this will 
be the major challenge for coronavirologists in the years to come. 
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CHAPTER 7 


The Coronavirus 
Nucleocapsid Protein 


HUBERT LAUDE AND PAUL S. MASTERS 


I. INTRODUCTION 


The nucleocapsid (N) proteins of coronaviruses, as with most enveloped vi- 
ruses, have received less attention than the surface glycoproteins and generally 
have been perceived to be of lesser concern. Interest in this class of proteins, 
however, has been stimulated in recent years by such diverse developments as 
the recognition of the importance of RNA-protein interactions, most notably 
in the study of spliceosomes and related ribonucleoproteins, as well as the 
finding in numerous viral systems that internal virion proteins can be major 
determinants of the cellular immune response. 

Coronaviruses represent, together with the recently characterized toro- 
viruses, the only positive-strand enveloped viruses with helically symmetric 
nucleocapsids. Thus, coronavirus N proteins present a number of intriguing 
molecular biological problems: What are the geometric parameters of these 
nucleocapsid structures and how do they compare to their negative strand RNA 
virus counterparts? What is the nature of the N-RNA and N-N interactions 
that stabilize the helix? Where are the loci of the multiple phosphorylations in 
the N molecule and what regulatory role (if any) do these play? How does N 
interact with the membrane protein (M) during virion assembly and uncoating? 
How does N participate in the unique mechanism of coronavirus RNA syn- 
thesis? Finally, what is the antigenicity of the N molecule? This chapter re- 
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views our present state of knowledge about these largely unanswered ques- 
tions. 


II. CORONAVIRUS RIBONUCLEOPROTEIN 


The structure of the nucleocapsid (or ribonucleoprotein, RNP} of corona- 
viruses has not yet been completely elucidated, although electron microscopic 
studies are in general agreement about the nature of this internal viral compo- 
nent. Ideally, a complete description of the coronaviral RNP would include: the 
diameter of the helical nucleocapsid as well as the diameter of the central 
cylindrical hole around which the helix is wound; the number of helical turns 
per unit length; the stoichiometry of nucleotides of RNA per monomer of N 
protein; the overall length of the encapsidated genome; and, finally, the super- 
structure of the helix packed within the virion. The existing studies, carried out 
by thin sectioning of virion particles (Apostolov et al., 1970) or by negative 
staining of spontaneously or detergent-disrupted virions (Kennedy and Johnson- 
Lussenburg, 1975/76; Macnaughton et al., 1978; Caul et al., 1979; Davies et al., 
1981], provide some of these parameters for some coronaviruses. 

The earlier morphological work described the coronavirus RNP as a thread- 
like strand 7-8 nm in diameter [avian infectious bronchitis virus (IBV): Apos- 
tolov et al., 1970)] or 8-9 nm in diameter [human coronavirus-229E (HCV-229E]: 
Kennedy and Johnson-Lussenburg, 1975/76}]. Subsequent work yielded images 
of clearly helical structures from HCV-229E (Macnaughton et al., 1978; Caul et 
al. 1979], murine hepatitis virus-3 (MHV-3) (Macnaughton et al., 1978), and IBV 
(Davies et al., 1981). These had reported diameters of 9-11 nm, 11-13 nm (Caul et 
al., 1979], or 14-16 nm (Macnaughton et al., 1978), with 3- to 4-nm diameter 
hollow cores. The variations in observed dimensions possibly reflect differ- 
ences in the methods of sample preparation. These helix diameters fall at the 
most narrow extreme for those observed with nucleocapsids in the paramyxo- 
virus family (i.e., in the pneumovirus genus). RNP structures of IBV were the 
most difficult to prepare among the viruses studied (Macnaughton et al., 1978; 
Davies et al., 1981) and fully elongated unwound helices were often observed, 
the longest of these being 6.7 um in length. 

The long, helical coronavirus RNP apparently must assume a higher-order 
structure to be packaged into the viral particle. For HCV-229E, a tightly coiled 
superstructure (approximately 60-nm diameter] was sometimes observed in 
material from disrupted virions, although more often this appeared as a tangled 
or knotted mass (Kennedy and Johnson-Lussenburg, 1975/76). Similarly, a 
coiled or helical superstructure was seen for the RNP of MHV-A59 virions 
budding into the endoplasmic reticulum of infected cells (Holmes and Behnke, 
1981). This coiling suggests that the coronavirus helical nucleocapsid is fairly 
flexible, another structural similarity it shares with paramyxovirus nucleo- 
capsids. 

Several biochemical features of virion-associated N protein, established in 
early studies, were consistent with its postulated role as the structural protein 
of the coronavirus RNP. N was found to be the sole virus polypeptide not to be 
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glycosylated and to be phosphorylated (Pocock and Garwes, 1977; Wege et al., 
1979; Stohlman and Lai, 1979; Callebaut and Pensaert, 1980; Macnaughton, 
1980; Cavanagh, 1981; Lomniczi and Morser, 1981; Stern et al 1981; Rottier et 
al.,1981a; Siddell et al., 1981b; King and Brian, 1982). A relatively high content in 
arginine and glutamine residues was established, consistent with the basic 
character anticipated for a nucleic acid binding protein (Garwes et al., 1976; 
Sturman, 1977). Also, protease digestion of virions left the N polypeptide com- 
pletely unaffected, arguing for its internal location (Sturman, 1977; Callebaut 
and Pensaert, 1980; Obert et al., 1981; King and Brian, 1982; Schmidt and Kenny, 
1982). 

Direct experimental support for the assignment of the N polypeptide as the 
nucleocapsid protein came from the analysis of the composition of subviral 
components from detergent-solubilized virions separated on density gradients. 
In several reports, treatment of the viral envelope with a nonionic detergent in 
low salt concentration failed to liberate true RNP structures. Thus, gel electro- 
phoretic analysis of subviral particles released by NP40 treatment at 20 °C from 
transmissible gastroenteritis virus (TGEV) virions (buoyant density 1.295 g/ml 
in cesium sulfate: Garwes et al., 1976), MHV-JHM virions (1.26 g/ml in sucrose: 
Wege et al., 1979], hemagglutinating encephalitis virus (HEV) virions (Pocock 
and Garwes, 1977), or from IBV virions treated with Triton X-100 at 37 °C (1.32 
g/ml in sucrose: Lancer and Howard, 1980) revealed the presence of two poly- 
peptides, corresponding to the nucleocapsid protein N and the membrane glyco- 
protein M. The recovery of RNP structures containing essentially only the N 
polypeptide species was reported following NP40 treatment at 0-4 °C of MHV- 
A59 virions (Sturman et al., 1980) and of IBV virions (1.27 g/ml in sucrose: 
Davies et al., 1981). The MHV study recognized a subtle influence of tempera- 
ture on the interaction of M protein with the RNP: virions disrupted by NP40 at 
4°C released a structure banding at 1.28 g/ml in sucrose and containing only N 
protein, while subsequent incubation at 37 °C produced an M protein-nucleo- 
capsid complex banding at 1.22 g/ml. 


III. N PROTEIN STRUCTURE 


Almost all of our present structural knowledge of the N proteins of corona- 
viruses derives from amino acid sequences deduced from nucleotide sequences 
of cloned N gene cDNAs. To date, these have been reported for 11 coronaviruses: 
MHV (Skinner and Siddell, 1983, 1984; Armstrong et al., 1983, 1984; Parker and 
Masters, 1990; Kunita et al., 1992; Decimo et al., 1993), IBV (Boursnell et al., 
1985; Sutou et al., 1988; Williams et al., 1992), TGEV (Kapke and Brian, 1986; 
Rasschaert et al., 1987; Britton et al., 1988), bovine coronavirus (BCV) (Lapps et 
al., 1987; Cruciere and Laporte, 1988), HCV-OC43 (Kamahora et al., 1989), 
HCV-229E (Schreiber et al., 1989; Myint et al., 1990], feline infectious peri- 
tonitis virus (FIPV] (Vennema et al., 1991), turkey coronavirus (TCV) (Verbeek 
and Tijssen, 1991), canine coronavirus (CCV) (Horsburgh et al., 1992; Vennema 
et al., 1992), porcine epidemic diarrhea virus (PEDV) (Bridgen et al., 1993), and 
sialodacryoadenitis virus (SDAV) (Kunita et al., 1993). 
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The general physical properties of the N proteins are markedly similar 
(Table I). The encoded polypeptides range from 377 to 455 amino acids in length. 
Notably, MHV, SDAV, BCV, TCV, and HCV-OC43, which belong to the same 
antigenic cluster, have N proteins roughly 10% larger than those of the remain- 
der of the family. Also striking is the presence in PEDV NN protein of an 
approximately 40-residue-long stretch that has no counterpart in the other 
members (Bridgen et al., 1993]. This unique sequence, particularly rich in 
serine, arginine, and asparagine residues, is located in the central portion of the 
molecule (see Fig. 1) and might reflect a recombinational event or a stuttering of 
the viral polymerase since it exhibits some periodicity. The virion-associated N 
protein of coronaviruses resolves by sodium dodecyl sulfate-polyacrylamide 
gel electrophoresis (SDS-PAGE] as a single species with molecular weight rang- 
ing from 45 to 63 kDa, depending on the virus and on the strain (for references, 
see Siddell et al 1982]. These apparent sizes are significantly larger than the 
molecular weights calculated from sequence information, possibly due both to 
anomalous distributions of amino acid compositions and to phosphorylation of 
the proteins. Even among five closely related strains of MHV, molecular weight 
differences of up to 3 to 5 kDa have been observed (Cheley et al 1981], although 
the calculated molecular weights for these species are almost identical (Parker 
and Masters, 1990). 

Coronavirus N proteins are highly basic, having an excess of at least 12 
lysine and arginine residues over aspartate and glutamate residues. One mea- 
sure of this is seen in the calculated overall isoelectric points (pIs) of these 
polypeptides, which fall in the range 10.3-10.7 (Table I). Basic amino acids are 
not clustered in strings as in proteins such as simian virus 40 (SV40) T antigen 
or the capsid proteins of alphaviruses, but local densities of positive charge can 
be found, particularly in two loci in the middle of the N molecule (Fig. 1). In 
marked contrast to the bulk of the molecule, the carboxy-terminus of each 
coronavirus N protein is quite acidic, as indicated by pIs ranging from 4.3 to 5.5 
calculated for the 45 carboxy-terminal amino acids (Table I). Another notewor- 
thy feature of the N proteins is that they all have a relatively high serine content 
(7—1196] (Table I). These potential targets for phosphorylation (see Section V) are 
distributed throughout the molecule, but many are interspersed in the first of 
the two basic regions mentioned above. It is interesting that most of the above- 
cited characteristics of coronavirus N proteins are also shared by the nucleocap- 
sid (NP) proteins of influenzaviruses (Table I], perhaps due to an ancestral 
relationship between these two families. 

In spite of the general similarities indicated for the coronavirus N proteins, 
there exists only alow degree of sequence homology among them. For example, 
the N protein of BCV shows an overall sequence homology of 29% with the N 
protein of TGEV or of IBV; the sequence homology of the N proteins of MHV 
and BCV, which belong to the same antigenic cluster, is 70%, i.e., notably less 
than between the M proteins (Lapps et al., 1987). An optimal alignment of the 11 
sequences reveals about 30 residues common to all, including 8 glycines and 3 
prolines; in contrast, the few cysteine residues present in the sequences are not 
at conserved positions. An exception to this overall lack of N protein homology 
is a region of roughly 50 amino acids, falling within the amino-terminal third of 
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each molecule, which contains a relatively high degree of sequence identity 
(Fig. 2). Such a marked degree of conservation may signal an important function 
common to the N proteins. The strong proportion of aromatic residues in this 
segment possibly participate in stabilizing N binding to RNA via ring stacking 
interactions (Ollis and White, 1987]; alternatively, the conserved segment may 
be involved in other hydrophobic interactions not directly related to RNA 
binding. 

To date, two coronavirus N protein mutants have been described, both in 
MHV-AS59. Each is thermolabile as well as temperature-sensitive (ts), forming 
small plaques at the nonpermissive temperature. The first of these, Alb] (Mas- 
ters et al., 1994], has a single critical amino acid change within the highly 
conserved region mentioned above, as well as a closely linked amino acid 
change that is phenotypically silent. The second, Alb4 (Koetzner et al., 1992), 
contains a 29-amino-acid deletion of a putative spacer region that separates the 
carboxy-terminal domain of the N protein from the remainder of the molecule. 
In addition to these, the ts209 mutant of MHV-A59 (Koolen et al., 1983) is 
potentially an N protein mutant (see Section IV) but has not yet been mapped. 
Further study of these mutants may provide new insights into N protein struc- 
ture and function. 








MHV-A59 102 |c v wl yfr] wisls s[F]K T P p[|e o q x o[z]r P 
SDAV 102 |e x w|v|n|g N/R/R S|FIK 7 p p|clo o x oitilt P 
BCV 99 IG v wly|R|H N|R|R S|F|K T A DIG|N o R Q/LIL P 
TCV 99 |e v wly|R|H NIRIR S|F|K T A DIGIN Q R olur P 
Hcv-0c43 99 |e v w|v|n|u N/RIG s[r]x T A DiGIN o R Q|L|L P 
TGEV ep le v win[R(Q TIR|- - Y RM v KIGIQ RK E|LIP E 
CCV ep |G v wuN|Rn|o TIRI- - Y RM v KIGIR RK E|[L/P E 
FIPV 68 |e y wiu[s]o I]/R|- - Yx RIV KIGIQRKE/LIAE 
HCV-229F 56 |G v s |N vo kx|n|- -[r R T R K]G/K R v Diuls P 
PEDV 51 levy WINE QI|RI- -WRMRRIGIERIEQPS 
IBV 69 |e v w/R[rR]o a|n]- -[z] x p eo RiG]o RK PV p D 
R wyj YYLGTGPHAlLGaASyGopDs1eEfe v| [W v al 152 
RWIYIFYYLGTGPHAIGASFGDS1E|G v| |w v à| 152 
RWw|YFYYLG”TGPHAKDQY G rp:ri[po vir|w v a| 149 
RúWu|YFYYLGTGPHAKDQY GT TDICLD G vl|r|w v A| 149 
RwW|YFYYLGTGPHAKDQY G TD)DILMDcCvvy[lw v A| 149 
RWIFIF YYLGTGPHAIDAKFKODKULDGvi|viwval 116 
RW|FIFYYLGTGPHAIDAKFKQKLIDGvi|viwval 116 
kK[(wr|r y[F]LGtGPHAIPDAKFKODKI|DGVIFIW Val 116 
KLHIFYYLGTGPHKDAKFRERVE|G v|viw v al 105 
N(wlHIF vvrnoTvTcPHlopnLRYxRTRTSEjo v|riw val 99 
A|w|v|r v v[T]|a m G P DL NW GD TOD oli v|w val 117 





FIGURE 2. Region of greatest homology among coronavirus N proteins. Numbers indicate the 
amino acid residues of each N protein shown in the alignment. Boxed residues are those which are 
identical in at least seven of the eleven sequences. The sources of the sequence data are given in the 
text. 


148 HUBERT LAUDE AND PAUL S. MASTERS 


IV. SYNTHESIS OF N PROTEIN AND NUCLEOCAPSID 
FORMATION 


The nucleocapsid protein appears to be the most abundant viral polypep- 
tide in coronavirus-infected cells during all stages of infection. IN protein is 
readily detected from 3 to 5 hr after infection onward, at the same time or earlier 
than the other structural proteins, and its synthesis is maintained throughout 
the virus cycle (Anderson et al., 1979; Bond et al., 1979; Holmes et al., 1981; 
Siddell et al., 1981b; Stern et al 1981; Rottier et al., 1981b; Stohlman et al 1983; 
Garwes et al., 1984; Talbot et al., 1984; Keck et al., 1988; Dea and Tijssen, 1989; 
Simkins et al 1989). In latently infected Schwannoma cells, MHV-JHM N 
protein was shown to be produced for as long as 7 days and beyond in the 
absence of infectious virus production (Coulter-Mackie et al., 1985). In cells 
infected with MHV-JHM defective-interfering RNAs, the level of N synthesis 
was found to be not significantly altered throughout multiple passages, in 
contrast to the other viral polypeptides (Makino et al 1985). Thus, it seems 
that a strong and permanent expression of N gene product accompanies both 
productive and nonproductive coronavirus infection. 

Coronavirus N protein is synthesized on free polysomes (Niemann et al., 
1982), but appears to become rapidly associated to intracellular membranes 
(Stohlman et al., 1983). Its subcellular location has been examined by immuno- 
fluorescence labeling of MHV-, TGEV-, and TCV-infected cells. Anti-N mono- 
clonal antibodies produce a bright particulate fluorescence which tends to 
distribute evenly in the cytoplasm (Robb and Bond, 1979; Laude et al., 1986; 
Holmes et al., 1987; Welch and Saif, 1988; Dea and Tijssen, 1989). Intriguingly, 
two groups of investigators have reported an important concentration of N-an- 
tigen within the nuclei of MHV-infected cells (Robb and Bond, 1979; Holmes et 
al., 1987). The MHV-A59 ts mutant Alb4, shown to have a short deletion in its 
N gene (Koetzner et al., 1992), exhibited an altered distribution of N antigen in 
the cytoplasm and the nucleus (Holmes et al., 1987). Translocation of N into the 
nucleus might reflect the ability of the protein to bind to nucleic acids in a 
nonspecific manner (see Section VI), or it may indicate the fortuitous presence 
of a nuclear translocation signal in this highly basic protein. However, nuclear 
translocation cannot be essential for productive infection since MHV is capable 
of replicating in enucleated cells (Brayton et al., 1981; Wilhelmsen et al., 1981]. 

Studies on the biosynthesis of coronavirus nucleocapsid proteins in in- 
fected cells have revealed the presence of one major intracellular form, desig- 
nated N, and of several N-related polypeptides. The major N species has been 
shown to comigrate with virion-associated N protein in denaturing gels (Siddell 
et al., 1981b; Rottier et al., 1981b; Laude et al., 1986; Dea and Tijssen, 1989) and 
to be phosphorylated (MHV: Siddell et al., 1981a; Stohlman et al., 1983; TGEV: 
Garwes et al., 1984), basic (Siddell et al. 1981b), and heterogeneously charged 
(Bond et al., 1979). The identity between intracellular and virion N has been 
further established by tryptic peptide fingerprinting in the case of MHV-JHM 
and MHV-A59 (Siddell et al., 1980; Bond et al., 1984). N polypeptide is a primary 
gene product, as demonstrated by pulse-chase analysis of cells infected by 
MHV-JHM and MHV-A59 (Siddell et al., 1981b; Rottier et al., 1981b; Holmes et 
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al., 1981; Perlman et al 1986/87), MHV-3 (Cheley and Anderson, 1981), TGEV 
(Laude et al., 1986], and BCV (Deregt et al., 1987). In agreement with this are the 
results of cell-free translation experiments, where N-assigned mRNA isolated 
from cells infected by MHV (Rottier et a1.,1981a; Siddell et al., 1980), TGEV 
(Jacobs et al., 1986), or FIPV (De Groot et al., 1987) programmed synthesis of a 
polypeptide species of the same size as the virion-associated protein. The same 
appears to hold true for recombinant N proteins expressed independently of 
other viral components in mammalian cells using vaccinia or adenovirus vec- 
tors (Pulford and Britton, 1990; Vennema et al., 1991; Stohlman et al., 1992, 
Wesseling et al., 1993) or in yeast cells (Britton et al., 1988). 

Several groups of investigators have mentioned the existence of one or 
more intracellular polypeptides with a slightly lower molecular weight (2-5 
kDa less) than the major N species, sometimes designated N', N", etc. These 
latter have been observed in immunoprecipitated or immunoblotted material, 
but also in direct lysates from MHV-]HM and MHV-A59, MHV-3, TGEV, BCV, 
IBV, and TCV, i.e., most of the coronavirus-host cell systems studied (Cheley 
and Anderson, 1981; Stern et al., 1981; Rottier et al., 1981b; Bond et al., 1984; 
Garwes et al., 1984; Coulter-Mackie et al., 1985; Robbins et al 1986; Keck et 
al., 1988; Welch and Saif, 1988; Dea and Tijssen, 1989). The relationship be- 
tween N and these faster migrating species has been confirmed by peptide 
fingerprinting in the case of MHV (Cheley and Anderson, 1981; Bond et al 1984} 
and IBV (Stern et al., 1981). Both the N and N’ species found in TGEV grown in 
LLC-PKI cells were phosphorylated (Garwes et al., 1984). 

While there is general agreement that only the full-length N species is 
incorporated into virions, it is as yet unclear whether the N subspecies result 
from a different level of phosphorylation, proteolytic degradation at preferential 
cleavage sites, premature termination, or translation of deleted transcripts 
arising from defective particles. It has been proposed that the 57-kDa species of 
MHV-JHM corresponds to a nonphosphorylated precursor; a phosphorylated 
species of 60kDa became apparent only after a 10-min chase and the ratio of 
57:60 kDa decreased as infection proceeded (Stohlman et al., 1983). In contrast, 
several groups of investigators reported that the N subspecies are made in 
increasing amounts during the course of infection (Anderson et al., 1979; 
Holmes et al., 1987; Talbot et al., 1984). Furthermore, pulse-chase experiments 
with both MHV-JHM and MHV-3 provided strong evidence that the lower 
molecular weight species are derived from the major N species (Anderson et al., 
1979; Cheley and Anderson, 1981). Comparison of immunoblots of MHV-A59- 
or MHV-JHM.-infected cell extracts done with antibody to full-length N or with 
antibody to a synthetic peptide of the carboxy-terminus led to the conclusion 
that the intracellular faster-migrating species lacked the carboxy-terminal do- 
main (Holmes et al., 1987). Similarly, the subspecies observed following vaccinia- 
vectored expression of TGEV N protein in two different cell system were both 
unreactive toward an anti-N antibody directed to the C-terminal end (Pulford 
and Britton, 1990]. 

The virus strain and/or the host cell might determine the number, size, and 
amount of N subspecies. Thus, cells infected by the MHV-A59 mutant ts209 
have been found to contain an extra N-related band (Koolen et al., 1983). On the 
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other hand, an extra band of N was detected in LLC-PKI1 but not in thyroid 
TGEV-infected cells (Garwes et al., 1984], and the N subspecies found in LLC- 
DEI and CVI cells expressing recombinant TGEV N protein differed in size 
(Pulford and Britton, 1990}. In several virus-cell systems, N subspecies were not 
observed (MHV-JHM: Siddell et al 1980; Niemann et al 1982; TGEV: Laude et 
al., 1986; BCV: Deregt et al., 1987). From the above observations it would appear 
that the N subspecies most likely represent molecules partially degraded by 
cellular or serum proteases, although this was not formally proved in any of the 
published data. Such a phenomenon, which may be of no biological signifi- 
cance, is reminiscent of the limited proteolytic cleavage of the nucleocapsid 
protein of paramyxoviruses (Mountcastle et al., 1974) and toroviruses (Hor- 
zinek et al., 1985). Faster-migrating N species have also been observed during in 
vitro mRNA translation and were assumed to be the result of a proteolytic 
degradation during immunoprecipitation (MHV: Rottier et al., 1981a; FIPV: De 
Groot et al., 1987) or of premature termination (MHV: Leibowitz et al., 1982). In 
addition, various truncated species of N protein have been characterized in the 
brain tissue from MHV-JHM- infected mice (Talbot et al 1984) and in immune 
complexes present in the body fluids of FIPV-infected cats (Horzinek et al., 
1986). 

There is limited information about the fate of N protein in infected cells. 
Studies performed with MHV indicated that only a very small fraction of N is 
chased out of the cell into mature virions even after a 6-hr chase period (Holmes 
et al., 1981), suggesting that a large pool of free N protein is built up early in the 
infection. MHV-A59- or MHV-JHM- infected cells appeared to contain a sub- 
stantial amount of free nucleocapsids; intracellular EDTA-resistant structures 
of 200—230S, which contained N protein and genomic RNA, were identified 
(Robb and Bond, 1979; Spaan et al., 1981). Perlman et al. (1986/87) determined 
the rate at which N protein was transferred into such nucleocapsid structures in 
the absence of de novo protein synthesis. The pool of free N protein in 
cycloheximide-treated cells was found to decay with a half-time of approx- 
imately 1 hr, which is, for example, quite slow compared with rhabdoviruses. 
Nearly all the genomic RNA was found to be incorporated in nucleocapsids 
(Spaan et al., 1981; Perlman et al., 1986/87). 


V. N PHOSPHORYLATION 


Phosphorylation is the only known posttranslational modification of 
coronavirus N proteins (Stohlman and Lai, 1979; Lomniczi and Morser, 1981; 
King and Brian, 1982; Garwes et al., 1984), and it is also another point of 
resemblance between the nucleoproteins of coronaviruses and influenza vi- 
ruses (Kistner et al., 1989). MHV N protein is phosphorylated in the cytoplasm 
within about 10 min of its synthesis and it concomitantly becomes tightly 
associated with a cell membrane fraction (Stohlman et al., 1983). N protein 
represents the major phosphopolypeptide in both MHV- and TGEV-infected 
cells (Siddell et al., 1981a; Garwes et al., 1984). For MHV, the phosphate linkage 
has been shown to be exclusively to serine residues (Stohlman and Lai, 1979; 
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Siddell et al 1981a). Even though the exact number and location of the phos- 
phoserines are still undefined, the process appears to be highly specific. An 
high-pressure liquid chromatography (HPLC) analysis of the tryptic phospho- 
peptides of MHV N revealed that phosphorylation may occur at only two or 
three sites (Wilbur et al 1986), a number relatively small in comparison to the 
30 to 40 potential target residues in the N proteins. Moreover, the two plaque 
morphology variants of MHV-JHM examined in this study were found to differ 
in their phosphorylation patterns. It remains unclear whether the process in- 
volves autophosphorylation or a protein kinase activity of either viral or host 
origin. For MHV, a virion-associated protein kinase activity has been described 
that can transfer additional phosphate from ATP to the N protein as well as to 
other substrates in vitro. This enzyme appeared to be cyclic AMP-independent 
and to require divalent cations (Siddell et al 1981a), features common to those 
of protein kinases identified in a variety of other enveloped viruses (Tan, 1975). 
Whether the same enzyme is responsible for the kinase activity in vivo, how- 
ever, has not yet been shown. 

The finding that some anti-MHV N monoclonal antibodies recognized the 
phosphorylated form but not a nonphosphorylated, faster-migrating species 
assumed to be its precursor, led to the proposal that phosphorylation may 
induce a drastic conformational change of the protein (Stohlman et al., 1983), 
possibly relevant to its biological function. However, the question of the role 
of N protein phosphorylation is largely unanswered. It has been speculated by a 
number of workers that it may govern the tightness of the association between 
N and RNA, but there is as yet no direct evidence that bears on this possibility. 
MHV N proteins derived from purified virions, cellular membranes, and the 
cytosol of infected cells were reported to have similar phosphopeptide maps 
(Wilbur et al., 1986), a finding in agreement with the observed rapidity of the 
phosphorylation process. On this basis, phosphorylation might not be expected 
to play a critical role in the regulation of virus assembly. Finally, it has been 
hypothesized that phosphatase-catalyzed nucleocapsid dephosphorylation might 
control the processes of coronavirus penetration and uncoating, and potentially 
relevant host enzyme activities have been sought. An acid phosphatase activity 
associated with the particulate fraction of oligodendrocytes has been shown to 
convert the MHV N protein to a lower molecular weight product (Beushausen 
et al., 1987). Also, a neutral, serine-threonine phosphatase activity was detected 
in the endosomal fraction of L-2 murine fibroblasts and in other susceptible 
cells, which recognized MHV N protein as a more specific substrate than acid or 
alkaline phosphatases (Mohandas and Dales, 1991}. The relevance of such data 
would certainly be greater if the site of fusion of entering coronavirus particles 
were known more precisely. 


VI. N PROTEIN BINDING TO RNA 


The most noteworthy function of coronavirus N proteins is that they bind 
to the RNA genomes of their respective viruses. Neither the nature nor the 
mechanism of this N-RNA complex formation has been well elucidated. Anal- 
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ogies with other RNA-binding proteins are limited. Coronavirus N proteins do 
not contain zinc finger motifs (Klug and Rhodes, 1987) or sequences similar to 
the RNA consensus and RNA-binding motifs found in many cellular RNA- 
binding proteins such as nucleolin, poly(A]-binding protein, and various hnRNP 
and snRNP proteins (Dreyfuss et al., 1988). With respect to the properties of 
other RNA viruses containing helical nucleocapsids, coronaviruses appear to 
most closely resemble the orthomyxoviruses. The N (or NP) proteins of both of 
these families form complexes with RNA that tend to dissociate in high con- 
centrations of salt and that provide only limited protection against the action of 
ribonucleases (Bukrinskraya, 1973; Macnaughton et al., 1978; Robbins et al., 
1986; Kingsbury et al 1987). This contrasts markedly with the N-RNA com- 
plexes of the rhabdoviruses and paramyxoviruses, which are stable in high salt 
and are markedly ribonuclease-resistant (Bukrinskraya, 1973; Leppart et al., 
1979). 

Efforts to characterize coronavirus N-RNA binding have sought to address 
two fundamental questions. First, what RNA sequences or structures are spe- 
cifically bound by N? Second, what parts of the N molecule participate in RNA 
recognition? The ability of the MHV N protein to bind to nucleic acids has been 
examined by an RNA overlay protein blot assay (ROPBA) in which electro- 
phoretically separated proteins from purified virions or from MHV-infected 
cells were transferred to nitrocellulose and probed with labeled RNA (Robbins 
et al., 1986). Monomeric N as well as a minor multimeric N component, 
possibly an N trimer, were the only virus-specific RNA-binding proteins de- 
tected in this manner. Nucleic acid sequence binding specificity was not dem- 
onstrated, however, since N bound equally well to MHV RNA or to single- or 
double-stranded RNA or DNA of heterologous origin. 

Subsequent adaptation of the ROPBA procedure allowed the demonstra- 
tion of sequence-specific binding of RNA by MHV N protein (Stohlman et al., 
1988). The inclusion of a large molar excess of unlabeled {uninfected} cellular 
RNA with the labeled probe of MHV-infected cellular RNA was found to 
compete out low-affinity binding of RNA by both N and cellular proteins. 
Synthetic RNA probes containing defined extents of the 5' leader RNA adjacent 
to either mRNA 1 or mRNA 6 were then used to localize a sequence-specific, 
high-affinity binding site for N to the region of nucleotides 56 to 67 of the MHV 
genome. This conclusion was reinforced by the observation that an anti-N 
monoclonal antibody could selectively immunoprecipitate from MHV-infected 
cells all leader-containing RNA species greater than or equal to 65 nucleotides 
in length (Baric et al, 1988). Since the leader sequence is contained in all 
subgenomic positive-strand RNAs, as well as in the genome, this may suggest 
that some recognition event other than binding by N protein mediates the 
selective incorporation of genome RNA into assembled virions. 

The assignment of the RNA-binding property of N to one or more particu- 
lar portions of the N molecule is at present incomplete. The detection by 
ROPBA of the species N' and N", presumed to be products of small proteolytic 
truncations, suggests that at least limited removal of one of the two termini of 
N does not alter RNA binding (Robbins et al., 1986). Recently, this point has 
been investigated using a nondenaturing gel assay of MHV N protein translated 
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in a rabbit reticulocyte lysate programmed with engineered synthetic mRNAs 
(Masters, 1992). In this system, full-length N protein was shown to bind to an 
endogenous RNA species in the reticulocyte lysate. Construction of a set of N 
gene deletions allowed the mapping of this RNA-binding activity to the central 
one of three domains of N protein proposed previously on the basis of sequence 
comparisons of different strains of MHV (Parker and Masters, 1990). Deletions 
entering the central domain abolished this non-sequence-specific RNA-binding 
activity. Sequence-specific RNA binding, as measured by ROPBA, was also 
mapped to the central portion of purified MHV N protein following its fragmen- 
tation by treatment with formic acid (Nelson and Stohlman, 1993). The isolated 
RNA-binding domain, comprising amino acids 169 to 308, retained the ability 
to interact specifically with its target RNA sequence. Considerable work re- 
mains to be done to delineate the minimal RNA-binding domain of N and to 
define the exact nature of the association between this molecule and RNA. 
Evidence recently has been presented that MHV N protein binds specifi- 
cally to cell membranes (Anderson and Wong, 1993]. The binding was inhibited 
by nonviral RNA and DNA, as well as by various membrane phospholipids such 
as cardiolipin, a property shared by certain DNA binding proteins. This obser- 
vation led the authors to speculate that membrane lipid association of the N 
protein may compete for RNA binding sites on the N protein, a mechanism 
possibly relevant to the processes of nucleocapsid uncoating and assembly. 


VII. N-N AND N-M PROTEIN INTERACTIONS 


Very little information is presently available about the protein-protein 
contacts N makes in assembled virions or in infected cells. The structure of the 
coronavirus nucleocapsid suggests that, at the very least, there must be some 
form of interactions between N monomers that neighbor each other upon each 
successive turn of the helix. There may also be an interaction between adjacent 
N monomers bound along the RNA strand, although these would not neces- 
sarily be required for helical encapsidation. 

It has been observed for some coronaviruses that SDS-PAGE, under non- 
reducing conditions, detected an N-related protein of 140-160kDa, present at a 
level of a few percent compared to N. This species was identified by immuno- 
blotting of BCV, HCV-OC43, and MHV virions (Hogue et al., 1984; Robbins et 
al., 1986; Deregt et al., 1987], and for MHV it was demonstrated to bind to RNA 
(Robbins et al., 1986). The N multimer appeared to be the size of a trimer, and it 
was shown to be held together by disulfide linkages, since it disappeared in the 
presence of mercaptoethanol. It is possible that this species is a fundamental 
unit of the nucleocapsid structure. However, since the N protein of each of 
these three viruses contains just two cysteine residues separated by only four 
amino acids, it seems likely that other types of N-N binding must also contrib- 
ute to stabilizing the helix. 

The coronavirus M protein is thought to be the principal determinant of 
virion assembly and budding. Consequently, it might be expected that there 
exist N-M interactions that play a major role in viral structure, although little 
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experimental evidence bears on this issue. During the development of techniques 
to fractionate and purify viral components, it was found that a temperature- 
dependent binding of M and the nucleocapsid occurred in NP40-disrupted MHV 
virions (Sturman et al., 1980). At 4?C, M protein was solubilized by the non- 
ionic detergent, but at ambient temperature or 37°C, M remained associated 
with the nucleocapsid, cosedimenting in sucrose gradients. However, the same 
study demonstrated that NP40-solubilized M was able to bind to RNA in the 
absence of N protein; thus, it is not clear whether the M-nucleocapsid binding 
being examined was due to an N-M interaction. Clearly, it would be desirable 
to examine the N-binding properties of the cytosolic carboxy-terminal segment 
of M in isolation from the remainder of the M molecule. 


VIII. POTENTIAL ROLE OF N PROTEIN IN RNA SYNTHESIS 


Various lines of evidence indicate a possible role for N protein in corona- 
virus RNA synthesis. An earlier, perplexing observation in this regard was the 
finding that, after multiple passages, MHV defective-interfering (DI) particles 
inhibited the synthesis of all normal viral RNA species except RNA7, which 
was translated abundantly into N protein (Makino et al., 1985). Although the 
mechanism of this selective resistance is unknown, it may imply that N is 
required for MHV RNA synthesis, or at least for the synthesis of DI RNA. 

More directly, it has been shown that antibodies to N protein almost totally 
inhibited viral RNA synthesis in an in vitro system prepared from MHV- 
infected cells, whereas, under the same conditions, no significant inhibition 
was seen with anti-M or anti-S antibodies (Compton et al., 1987). Moreover, the 
product RNA synthesized in this system, which was mostly of genome length, 
was found to be encapsidated by N protein. This might suggest that coronavirus 
RNA synthesis is coupled to the encapsidation of nascent RNA, analogous to 
the replication of viruses with helical negative-strand RNA nucleocapsids. For 
coronaviruses, however, the requirement for N may also apply to the synthesis 
of subgenomic RNAs. The immunoprecipitation of all leader-containing MHV 
RNAs, as well as replicative intermediate RNA complexes by an anti-N mono- 
clonal antibody, has been taken to mean that the N-leader RNA interaction 
must be important to the mechanism of leader-primed transcription (Baric et 
al., 1988). 

The analogy with some families of negative-strand RNA viruses may be 
inexact in another respect. The protein synthesis inhibitor cycloheximide has 
been shown to prevent or inhibit genomic and subgenomic RNA synthesis in 
MHV-infected cells (Sawicki and Sawicki, 1986], and this effect was mirrored by 
puromycin or cycloheximide in the in vitro RNA-synthesizing system (Comp- 
ton et al., 1987). For vesicular stomatitis virus (VSV), a rhabdovirus, the neces- 
sity for continuous protein synthesis in order to sustain RNA replication is due 
solely to a requirement for VSV N protein (Patton et al., 1984). However, for 
MHV-infected cells it was shown that, even in the presence of cycloheximide, 
substantial pools of soluble free N protein were available for the encapsidation 
of nascent RNA (Perlman et al., 1986/87). Thus, synthesis of N may not be rate- 
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limiting for coronavirus RNA synthesis. Given the complexity and enormity of 
the putative coronavirus polymerase and the uniqueness of the mechanism of 
coronavirus RNA synthesis, much needs to be done to clarify how N protein 
participates in this process. 


IX. ANTIGENIC PROPERTIES 


The nature of antigenic relatedness between the nucleocapsid proteins of 
different coronaviruses was first investigated using polyclonal sera raised 
against complete or subviral particles. The presence of common antigenic 
determinants on the N protein as well as on the other structural polypeptides of 
closely related viruses such as TGEV, FIPV, and CCV has been demonstrated. 
For instance, anti- IGEV sera have been shown to react with FIPV N and CCV N 
to about the same extent as with TGEV N in immunoblotting or enzyme-linked 
immunosorbent assay (ELISA) tests (Horzinek et al., 1982; Have et al., 1992). 
Similarly, antibodies directed against BCV N protein recognize HCV-OC43 N 
and MHV-A59 N (Hogue et al., 1984). However, it would appear that cross- 
reactivity at the level of the N protein is not restricted to members belonging 
to the same antigenic cluster. Thus, HCV-229E RNP could be detected by 
immunodiffusion using antisera to MHV, TGEV, or HEV (Yassen and Johnson- 
Lussenburg, 1978). An antigenic relationship was also observed between HCV- 
229E and MHV-3 using heterologous anti-RNP antisera but not antisera against 
virion external components (Hasony and Macnaughton, 1982). A two-way 
cross-reaction between the N proteins of FIPV and PEDV has been evidenced 
using both blotting and immunoprecipitation tests, whereas the viral envelope 
proteins exhibited no heterologous reactivity (Yaling et al 1988). These authors 
also obtained preliminary evidence of a cross-reactivity between the N proteins 
of IBV and MHV and of FIPV and HEV. Finally, cross-reactivity has been re- 
ported for the N proteins of TCV and IBV (Dea et al., 1990) and of TGEV and 
PEDV (Have et al., 1992). These observations suggest that a few common 
determinants may exist on the nucleocapsid antigen that are suitable for the 
identification of many or all members of the family. 

The N protein appears to be a substantially conserved antigen according to 
studies examining the reactivity of monoclonal antibodies (MAbs] toward dif- 
ferent strains of the same virus. The N protein of MHV exhibits a moderate 
degree of antigenic variation; several determinants common to up to 11 strains 
or isolates have been identified (Fleming et al., 1983; Talbot and Buchmeier, 
1985}. In both of these studies, the antigenic polymorphism of N protein was 
found to be less extensive than that of S protein. The first authors divided the 
MHV strains into two groups according to the relative binding of MAbs to N 
and proposed that such variation was correlated with their pathogenesis, 
whereas the second found that antigenic variation of S protein was a better 
correlate. The IBV N protein may also be subjected to antigenic variation since 
no fully conserved epitope could be detected within a panel of Australian field 
strains (Ignjatovic and McWaters, 1991). By contrast, N protein from different 
strains of TGEV exhibited little or no significant qualitative or quantitative 
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antigenic divergence (Laude et al., 1986; Welch and Saif, 1988; Sanchez et al., 
1990). Similarly, FIPV N epitopes were generally highly conserved, although a 
few type-specific epitopes were discriminated (Fiscus and Teramoto, 1987]. 
Thus, the interstrain relatedness is closer for TGEV and FIPV than in the case of 
MHV, as is also reflected by a lesser variation of S antigen. Finally, a proportion 
of common epitopes have been identified in viruses showing a close antigenic 
relationship, like TGEV and FIPV or BCV and TCV (Fiscus and Teramoto, 1987; 
Sanchez et al., 1990; Dea et al., 1990). 

The identification of B cell epitopes as well as their localization on the N 
primary structure is less documented than for the S protein. Epitope mapping 
by competitive assays using a small panel of anti-MHV N or anti-TGEV N 
MAbs delineated at least two nonoverlapping determinants (Talbot et al., 1985). 
A translation product obtained from an intracellular DI RNA of MHV-JHM and 
encoding the 89 C-terminal amino acids of N protein was found to be immuno- 
precipitated by an anti-N MAb (Makino et al., 1988). Four subfragments alto- 
gether covering the whole MHV-JHM N protein sequence were individually 
able to elicit an antibody response and reacted with sera from diseased Lewis 
rats, thus indicating that B cell epitopes are distributed throughout the entire 
length of the molecule (Wege et al., 1993). Western blotting immunoscreening 
of a series of bacterial fusion products has allowed a more detailed antigenic 
analysis of the TGEV N protein (Martin Alonso et al., 1992). Seven of eleven 
MAbs recognized the amino-terminal half of the polypeptide chain. Inter- 
estingly, one epitope was localized within a 60 amino acid stretch that essen- 
tially overlaps the most conserved region of the coronavirus N protein (see Fig. 
1). Asecond antigenic domain was delineated within the carboxy-terminal half, 
to which the heterogeneity reported for the N protein of PRCV relative to TGEV 
appears to be restricted. Additionally, an epitope 11 amino acids long has been 
localized very near the carboxy-terminus; this well-defined epitope has been 
used as a portable marker allowing detection of various fusion proteins (Parra et 
al., 1989). Taken together, the above observations suggest that a major epitope 
situated in the carboxy end might be a common feature of the coronavirus N 
proteins. Detergent-resistant epitopes have also been detected but not mapped 
in the case of MHV-JHM, BCV, and IBV N proteins (Talbot et al., 1984; Deregt 
and Babiuk, 1987; Ignjatovic and McWaters, 1991). 

Due to its internal position in the virus particle, the N protein might not be 
expected to bear neutralization-mediating determinants. Indeed, none of the 
coronavirus anti-N MAbs isolated thus far exhibited significant neutralizing 
activity. On the other hand, no B cell epitopes externally exposed at the cell 
membrane could be demonstrated in the case of MHV-JHM and TGEV N 
proteins (Collins et al., 1982; Laude et al., 1986; Laviada et al., 1990; Pulford and 
Britton, 1990). It has been claimed that one MAb specific for MHV-3 N protein 
reacted with the surface of infected cultured cells and neutralized viral cyto- 
pathic effect in the presence of complement, suggesting that at least one epitope 
of N protein is accessible to antibodies on the cell membrane (Lecomte et al., 
1987). Of possible relevance to this point is the apparent protective activity 
conferred by certain anti-N MAbs when transferred to mice subsequently le- 
thally infected by MHV-2 or MHV-3 (Nakanaga et al., 1986; Lecomte et al., 
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1987). This paradoxical observation parallels the well-documented protection 
by anti-N Ab to rabies (Lodmell et al 1993). 

Attention has been drawn only recently to the T-cell determinants poten- 
tially expressed on coronavirus antigens. Two MHC class II (I-E4)-restricted 
murine T-cell hybridomas generated after immunization with IBV have been 
shown to be responsive to the N protein (Boots et al 1991a). This response was 
strain-specific, thus confirming an antigenic variation of the protein among the 
different serotypes. Both the antigenic determinants were mapped within the 
region spanning amino acids 71 to 78 by using recombinant expression products 
and synthetic peptides. Furthermore, the epitope was shown to prime cellular 
immune response to IBV in the chicken (Boots et al 1991b). Studies on MHV- 
JHM-infected Lewis rats allowed the demonstration of an early and strong 
T-helper cell response specific for the N protein. N-specific CD4+ T-cell lines 
were established and shown to confer protection against acute encephalitis 
upon adoptive transfer to otherwise lethally infected animals (Körner et al., 
1991). The carboxy-terminally located, 95-residue-long fragment of N expressed 
as a bacterial fusion protein was shown to induce the most pronounced re- 
sponse and to mediate protection (Wege et al., 1993). Finally, the MHV-JHM 
strain has been reported to elicit an immunodominant anti-N protein cytotoxic 
T lymphocyte (CTL) response in BALB/c mice (Stohlmann et al., 1992). Indeed, 
among 21 CD8* T-cell lines derived from animals undergoing an acute de- 
myelinating encephalitis, 17 were found to recognize the N protein as a target 
(Stohlman et al., 1993). By using a combination of truncated forms of N ex- 
pressed by vaccinia virus recombinants and a series of overlapping peptides, the 
response has been mapped to an epitope comprising residues 316 to 330, which 
contains a described L4 binding motif (Bergmann et al., 1993). Despite a natural 
sequence variation affecting two residues of the motif, a cross-reactivity of the 
JHM N-specific CTL with six other MHV strains was observed. Altogether, the 
above studies lend support to the view that the N protein may be an important 
antigen with respect to both helper and cytotoxic immune response. 


X. CONCLUSION 


The N proteins have occupied a more peripheral position thus far in the 
study of coronaviruses. A substantial fraction of the papers cited in this chapter 
have been concerned with N as an entity tangential to the principal focus of the 
study in hand. Thus, our information about this protein is incomplete, and the 
answers to many important questions remain obscure. An understanding of the 
organization of the coronaviral nucleocapsid is of utmost importance for a 
complete elucidation of how the virus expresses its genetic information once it 
reaches the cytoplasm. The persistent association of N protein with the genome 
RNA, and possibly with the subgenomic RNAs, suggests that exploring the 
roles of N in coronavirus transcription and translation will be a fertile area of 
research for years to come. Techniques recently developed have taken advan- 
tage of the high rate of RNA recombination that occurs during coronavirus 
replication in order to engineer site-specific N gene mutations into the genome 
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of MHV. It is expected that this approach will greatly facilitate the elucidation 
of N protein structure function relationships in the future. It should also be 
possible to extend this methodology to the study of other genes and other 
members of the coronavirus family. 
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CHAPTER 8 


The Coronavirus 
Hemagglutinin Esterase 
Glycoprotein 


Davip A. BRIAN, BRENDA G. HOGUE, 
AND THOMAS E. KIENZLE 


I. DISCOVERY OF THE HEMAGGLUTININ ESTERASE PROTEIN 


Early reviews on coronavirus structure (Siddell et al., 1983a,b; Sturman and 
Holmes, 1985) described coronaviruses as having three major structural pro- 
teins: a large-surface [or peplomer) glycoprotein of around 200 kDa, a phos- 
phorylated nucleocapsid protein of around 50 kDa, and a glycosylated, multi- 
spanning membrane protein of around 30 kDa. This description was based 
primarily on studies of the prototypic avian infectious bronchitis virus (IBV) 
and the highly studied mouse hepatitis virus (MHV|, strain A59. Although IBV 
was shown early on to have a weak hemagglutinating property, detection of the 
hemagglutinating activity required that the virus first be treated with phospho- 
lipase C or concentrated by centrifugation in sucrose gradients (Bingham et al., 
1975). Not all strains of IBV demonstrated hemagglutination, however, and 
hemagglutinating activity by IBV, as well as by the porcine transmissible gastro- 
enteritis virus (TGEV] (Noda et al., 1987, 1988) was probably a cryptic property 
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of the spike protein (Cavanagh and Davis, 1986). The nature of this hemaggluti- 
nating activity is not well understood. 

Several coronaviruses, unlike those described above, have been shown to 
have a strong hemagglutinating property. The activity requires no pretreatment 
of virus. These are the bovine coronavirus (BCV] (Sharpee et al., 1976), the 
hemagglutinating encephalomyelitis virus of swine (HEV) (Callebaut and Pen- 
saert, 1980), human coronavirus OC43 (HCV OC43) (and OC38 and other close 
relatives) (Kaye and Dowdle, 1969), the turkey coronavirus (TCV) (Dea and 
Tijssen, 1988), and the MHV-related diarrhea virus of infant mice (DVIM] (Sugi- 
yama and Amano, 1980, 1981; Sugiyama et al., 1986). Interestingly, in all cases 
where it has been examined, these viruses show a second, shorter fringe of 
peplomers by electronmicroscopy (Bridger et al., 1978; Sugiyama and Amano, 
1981]. 

The first report showing the possibility of a fourth major structural protein 
for the strongly hemagglutinating coronaviruses was that by Callebaut and 
Pensaert for HEV (1980). A glycoprotein of 140 kDa was described that reduced 
to subunits of 76 kDa after treatment with 2-mercaptoethanol. The gradient- 
purified 140-kDa glycoprotein appeared to have hemagglutinating properties. 
The character of the fourth major structural protein was more clearly shown by 
King et al. for the bovine coronavirus (King and Brian, 1982; King et al., 1985). 
The 140-kDa glycoprotein was not removed from BCV by bromelain [whereas 
the surface protein was) and BCV retained its ability to hemagglutinate mouse 
erythrocytes. The property of hemagglutination was therefore associated with 
the 140-kDa glycoprotein. This association was further supported by four other 
observations. (1) Monospecific, polyclonal rabbit antiserum against the BCV 
hemagglutinin failed to identify a homologous protein on the nonhemag- 
glutinating MHV A59 virion, an otherwise close antigenic relative of BCV 
(Hogue et al., 1984]. (2) Monoclonal antibodies to the BCV 140-kDa protein 
inhibited hemagglutination by BCV (in addition to showing neutralizing activ- 
ity) (Deregt and Babiuk, 1987; Deregt et al., 1989]. (3) The presence of the 140- 
kDa protein on HCV OC43 correlated with hemagglutinating activity during 
protease digestion experiments (in a pattern similar to that for BCV) (Hogue and 
Brian, 1986). (4) The dimeric 140-kDa protein has been found on all of the 
strongly hemagglutinating coronaviruses studied to date. 


II. DEDUCED AMINO ACID SEQUENCE AND PROPERTIES OF 
THE BOVINE CORONAVIRUS HEMAGGLUTININ ESTERASE 
PROTEIN 


Extensive studies demonstrating the pattern of synthesis, assembly, and 
glycosylation of the hemagglutinin protein (Deregt et al., 1987; Hogue et al., 
1989) were done prior to knowledge of the hemagglutinin gene sequence or 
knowledge of its acetyl esterase activity. These are discussed here, however, in 
the context of the deduced amino acid sequence for the protein. 

The gene encoding the hemagglutinin esterase (HE) protein maps on the 5' 
side of the spike protein gene. This was predicted from the size of a mRNA 
(species 2b) that was present in BCV, but not in MHV A59 or some strains of 
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MHV-JHM, and from the 3' nested-set pattern of coronavirus mRNAs (Abra- 
ham et al., 1990; Keck et al., 1988; Siddell, 1983). Interestingly, the [defective] 
gene encoding the HE protein was first sequenced for the nonhemagglutinating 
MHV A59 (Luytjes et al 1988). Examination of the gene sequence revealed 
three important features. (1) The MHV-A59 genome contains a nonfunctional, 
apparently vestigial, hemagglutinin gene with no initiation codon for transla- 
tion of the HE open reading frame (ORF). This protein, as demonstrated by 
Hogue et al. (1984), is not made in MHV-A59-infected cells. (2) There is no 
UCUAAAC consensus intergenic sequence of the type thought to play a role in 
the initiation of transcription immediately upstream of the HE ORF. (3) Within 
the deduced HE amino acid sequence there is an F-G-D-S sequence identical to 
the acetyl esterase active site on the HE molecule of influenza C (Luytijes et al., 
1988]. 

Acetyl esterase activity, which functions to destroy a virus receptor (dis- 
cussed in Section VI), was first demonstrated for the BCV hemagglutinin 
(Vlasak et al., 1988a). The hemagglutinin was therefore renamed the hemagglu- 
tinin esterase protein. 

Some strains of MHV-JHM express a homologous 65-kDa protein (Makino 
et al., 1983; Siddell, 1981; Taguchi er al., 1986). However, MHV-JHM and also, 
for example, MHV-S isolates, hemagglutinate, if at all, only very poorly (Shieh 
et al., 1989; Sugiyama et al., 1986; Talbot, 1989; Yokomori et al., 1989). The 
reasons for this are unclear, as the cloned MHV-JHM HE gene, expressed using a 
vaccinia virus/T7 RNA polymerase system, encodes a protein that dimerizes, 
has acetyl-esterase activity, and is able to hemadsorb rat erythrocytes (Pflei- 
derer et al., 1991). | 

Studies on the synthesis and glycosylation of the BCV HE protein estab- 
lished that it is synthesized as a monomeric apoprotein of approximately 42 
kDa that becomes cotranslationally glycosylated and rapidly disulfide-linked 
(Deregt et al., 1987; Hogue et al., 1989). Only asparagine-linked glycosylation is 
found, with six to seven sugar chains of either the high mannose or hybrid type 
and three to four chains of the complex type attached per subunit, as deduced 
from a final subunit size of 65 kDa. Since coronaviruses assemble by budding 
into transitional elements occurring between the endoplasmic reticulum and 
pre-Golgi (Tooze et al., 1988), the maturation of HE carbohydrates to complex 
forms must take place on the virion during migration through the exocytic 
pathway to the cell surface. 

The deduced amino acid sequence of the HE protein, as determined from 
cDNA clones of BCV genomic RNA (Fig. 1], shows it to be a 47,700-Da apopro- 
tein of 424 amino acids (Kienzle et al., 1990; Parker et al., 1989; Zhang et al., 
1991). There are two very hydrophobic regions. The first encompasses the first 
18 amino acids and has the properties of a signal peptide. Amino-terminal 
amino acid sequencing of the processed virion HE protein identifies a phenyl- 
alanine, indicating that the first 18 amino acids indeed function as a signal 
peptide for membrane translocation (Hogue et al., 1989). The deduced molecu- 
lar weight of the nonglycosylated protein without its signal peptide is 45 kDa. 
The second hydrophobic region extends from amino acid 390 through 415 and is 
known to serve as the membrane anchor for HE (Kienzle et al 1990; Parker et 
al., 1990]. Therefore, the resulting HE is a type I glycoprotein with an amino- 
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FIGURE 1. Sequence of the BCV Mebus strain HE gene and deduced amino acid sequences of the 
HE, IORF 1, and IORF 2 proteins. The nucleotide sequence begins 15 bases upstream from the 
presumed start codon of the HE gene [8,694 bases from the 3' end of the genome, excluding the 
poly(A) tail] at the CTAAAC consensus sequence (doubly underlined) and ends with the ATG 
initiation codon of the spike protein gene. The amino-terminal signal peptide and presumed 
carboxy-terminal anchor sequences are underlined. The amino terminal signal cleavage site is 
indicated by an arrow (T). The putative esterase active site (F-G-D-S] and potential N-linked 
glycosylation sites (NXS or NXT, where X # P) are boxed. The CTAAAC consensus sequence 
preceding the spike protein gene is also doubly underlined. The GenBank accession number of the 
nucleotide sequence is M31052. 


terminal ectodomain and a carboxy-terminal anchor with nine amino acids 
residing on the cytoplasmic (intravirion) side of the membrane. There are eight 
potential N-glycosylation sites on the ectodomain of the protein. The F-G-D-S 
putative esterase-active site is at amino acid positions 19 through 22 of the 
processed protein. There are 13 cysteine residues in the processed subunit, but 
it is not known which of these are used in interchain disulfide bonding to form 
the dimer or intrachain folding. 

The BCV HE gene contains within it two nonoverlapping ORFs, one (IORF 
1) that could encode a 136 amino acid protein of 15,741 Da, and another (IORF 2) 
that could encode an 83 amino acid protein of 9,514 Da (Fig. 1). It is not known 
whether these are expressed during infection. The existence of these proteins is 
questionable since both are very hydrophobic throughout their length and there 
are no large internal ORFs within the HE gene of MHV-JHM or the HE pseudo- 
gene of MHV A59 [from our analysis of data in Luytjes et al. [1988] and Shieh et 
al. (1989]]. 

Expression of the cloned BCV HE gene, both in vitro and in vivo, has 
established that the HE protein alone contains all the necessary information for 
disulfide-linked homodimerization, membrane translocation, anchorage, trans- 
port to the cell surface as an integral membrane protein, erythrocyte-binding 
activity, and esterase activity (Kienzle et al., 1990; Parker et al., 1990; Yoo et al., 
1992). Cell surface expression of HE may explain hemadsorption by infected 
cells, although other explanations such as binding by nonreleased surface- 
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adherent viruses have been given (Bucknall et al., 1972; Kapikian et al., 1972). 
Cells infected with an adenovirus recombinant expressing the BCV HE exhibited 
both hemadsorption and esterase activities, both of which could be blocked by 
monoclonal antibodies having neutralizing activity (Parker et al 1990, Yoo et 
al., 1992). 


III. COMPARISON OF THE HEs OF BCV, HCV-OC43, MHV, AND 
INFLUENZA C VIRUS 


A remarkable conservation of amino acid sequence between the deduced 
HE proteins of MHV A59 (Luytjes et al., 1988) and influenza C virus (Nakada et 
al., 1984) suggested a common evolutionary origin for these proteins (Luytjes et 
al., 1988). A similarly high degree of homology among the HEs of influenza C 
virus, BCV, and HCV-OC43 [Parker et al., 1989; Kienzle et al., 1990; Zhang et al., 
1992) and a striking conservation in the position of the putative acetylesterase 
active site and several of the cysteine residues (Fig. 2) further supported the idea 
of a common origin for the HE proteins. Although the HEs of BCV and HCV 
OC43 are quite similar to each other, they as a group appear to be no closer to 
influenza C virus HE in sequence similarity than does MHV-JHM HE (Fig. 2), 
suggesting they are equidistant in evolutionary time. Because of the ability of 
coronaviruses to undergo recombination at a high frequency (Lai et al., 1987), it 
is conceivable that during coinfection with influenza C virus, a coronavirus 
(e.g., HCV OC43} could have undergone a similar recombination to give rise to 
the HE gene in the coronavirus genome (Luytjes et al., 1988). One major inter- 
esting difference between the HEs of BCV and influenza C is that the influenza 
C HE is formed by the proteolytic cleavage of a large precursor (Herrler et al., 
1981), whereas the BCV HE is not. 

Interestingly, the HE of influenza C virus (Nakada et al., 1984) with which 
BCV HE shows 2996 amino acid sequence homology does form disulfide-linked 
dimeric structures (Herrler et al., 1981). Comparative structural analyses might 
also suggest the site of the hemagglutinating (cell binding) domain, assuming it 
is common among the various HEs. 


IV. MODEL OF THE BCV HE ON THE VIRION 


From evidence of the membrane orientation and dimeric structure of BCV 
HE, we have drawn a tentative model for the dimeric protein (Fig. 3). Placement 
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FIGURE 2. Alignment of the deduced amino acid sequences for the HE genes of MHV (JHM), BCV 
(Mebus), HCV-OC43, MHV-JHM, and influenza C virus (C/Cal/78). The MHV (Shieh et al., 1989] 
and BCV (Kienzle et al., 1990), HCV-OC43 (Zhang et al., 1992), and influenza C (Nakada et al., 1984] 
sequences were each aligned using the PileUp program in the Wisconsin GCG Sequence Analysis 
Package. The sequences begin with the signal peptide of each protein. The influenza C sequence 
contains all of the HA1 sunbunit and the first 26 amino acids of the HA2 sunbunit. Gaps (identified 
by dots) were introduced to maximize alignment. Aligned cysteine residues in all four sequences are 
boxed. The esterase site (FGDS) is boldfaced. 
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FIGURE 3. Model of the BCV HE protein. Identical chains representing the 65-kDa subunits 
(without N-terminal signal peptides) are shown as disulfide-linked monomers. Open circles repre- 
sent hydrophilic amino acid residues (Asp, Glu, Lys, Arg, His, Asn, Gln), dotted circles represent 
intermediate amino acid residues (Gly, Ser, Thr, Tyr, Try, Pro), and solid circles represent hydro- 
phobic amino acid residues (Ala, Val, Leu, Ile, Phe, Cys, Met). Amino acids were classified by the 
scale of Kyte and Doolittle (1982). Amino acid position numbers are shown (beginning with the N 
terminus of the signal peptide, not shown). Disulfide bridges between cysteine residues are repre- 
sented by connected solid circles. The potentially glycosylated Asn residues in the ectodomain are 
identified by open diamonds. Triangles identify the putative active site of the esterase (F-G-D-S). 
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of disulfide bonds and protein loops is purely arbitrary but done in a way to 
maximize exposure of the hydrophilic, sugar-containing regions and the F-G- 
D-S esterase active site. It remains to be determined whether higher-order 
noncovalent oligomeric forms of the dimeric HE exist. 


V. HEMAGGLUTINATING ACTIVITY AND ITS POSSIBLE ROLE 
IN VIVO 


It has been proposed that HE may function as a second attachment protein 
that, in addition to the spike protein, serves to initiate infection (Parker et al., 
1989; Vlasak et al., 1988a). This proposal was based primarily on the following 
observations: (1) The process of hemagglutination itself is a cell-binding process 
that may mimic cell binding during infection. (2) The HE homologue on influ- 
enza C virus was known to be an attachment protein that recognizes 9-O- 
acetylated sialic acid-containing cell receptors (Herrler et al 1985; Herrler and 
Klenk, 1987; Rogers et al., 1986; Vlasak et al., 1987, 1989]. (3) A panel of 
monoclonal antibodies to the BCV HE protein exhibited neutralization of infec- 
tion in cell culture and in the calf (Deregt et al 1989). With these antibodies, 
four neutralizing epitopes could be identified on the HE molecule (Deregt and 
Babiuk, 1987). 

The idea that HE may function as a second attachment protein seemed to 
be further strengthened by the striking correlation between its presence and the 
neuronotropism expressed by the virus (Table I). Targeting of the HE-containing 
coronaviruses to the brain of experimentally infected mice was demonstrated 
for BCV (Akashi et al., 1981; Kubo et al., 1982; Barthold et al., 1990), HCV OC43 
(Pearson and Mims, 1983; Barthold et al., 1990), and HEV (Yagami et al., 1986, 
1993). Except for the restricted growth of HE-less MHV A59 in neurons of the 
subthalamic nucleus and substantia nigra of mice (Fishman et al., 1985) and the 
neuronotropism demonstrated by the HE-less MHV3 (Talbot, 1989; Tardieu et 
al., 1986), it is particularly striking that coronaviruses possessing HE are neu- 


TABLE I. Neuronotropism of HE-Containing Coronaviruses Cell Tropism 
in Central Nervous System 





Coronavirus  Hemagglutinin form Astrocytes Oligodendrocytes Neurons References 





MHV A59 No HE + + —4 b 
MHV-JHM Dimer + + b-e 
DVIM Dimer ? ? ? f-h 
BCV Dimer ? ? + i 
HCV OC43 Dimer = + + j-l 
HEV Dimer ? + m-o 
TCV Dimer ? ? ? p 


"There is restricted growth in neurons of the subthalamic nucleus and substantia nigra in mice (Fishman et al., 
1985). 

References: b, Dubois-Dalcq et al. (1982); c, Knobler et al. (1981); d, Parham et al. [1986]; e, Taguchi et al. (1986), 

f, Sugiyama and Amano (1980); g, Sugiyama and Amano (1981); h, Sugiyama et al. (1986); i, Kubo et al. (1982), 

j, Pearson and Mims (1983); k, Pearson and Mims (1985); 1, Barthold et al. (1990); m, Andries and Pensaert (1981); 

n, Yagami et al. (1986); o, Yagami et al. (1993); p, Dea and Tijssen (1988). 
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ronotropic in either the natural infection (e.g., HEV) or during experimental 
infection in mice (Table I). One possible explanation for this neuronotropism 
might be the high concentration of 9-O-acetylated sialic acids on brain ganglio- 
sides (Haverkamp et al., 1977; Herrler and Klenk, 1987). It will be important to 
determine whether the neuronotropism exhibited by some, but not all, strains 
of MHV-JMH (Dubois-Dalcq et al., 1982; Fleming et al., 1987; Goto et al., 1986; 
Knobler et al., 1981; Massa et al., 1988; Morris et al., 1989; Parham et al., 1986; 
Taguchi et al., 1986; Wege et al., 1982) is a function of the variability in expres- 
sion of the HE protein (Shieh et al., 1989). One recent study showed that JHM- 
infected mice were protected when passively immunized with HE-specific 
nonneutralizing monoclonal antibodies, suggesting that this protein does play a 
role in infection by MHV-JHM (Yokomori et al., 1992). 

Recently the potential role of HE as an attachment protein for initiating 
infection has required reevaluation since the peplomeric spike proteins on BCV 
have also been shown to bind 9-O-acetylated sialic acid, the receptor determi- 
nant for BCV infection, and to cause hemagglutination (Kinkel and Herrler, 
1993; Schultze et al., 1991a; Schultze and Herrler, 1992; Storz et al., 1991). 
Whether the S protein of BCV also recognizes proteins similar to the carcino- 
embryonic antigen as does MHV or the aminopeptidase N protein as does HCV 
229E or TGEV (see Chapter 4, this volume] remain to be determined. But the 
fact that S binds to the same receptor determinant as HE would suggest that 
they both use the same receptor protein to initiate infection, if in fact they both 
do initiate infection. In a study where separately purified S and HE proteins 
from BCV were used to measure hemagglutination, a difference in the agglutin- 
ability of erythrocytes from different animal species was found (Schultze et al., 
1991a,b). Both S and HE were able to agglutinate mouse and rat erythrocytes; 
however, only S could agglutinate chicken erythrocytes. This was thought to 
reflect a more powerful binding by S since the chicken erythrocytes have fewer 
surface sialic acid molecules. These observations have led Herrler and co- 
workers to suggest that S and HE recognize the same receptor determinant, but 
that S is the important molecule for initiating infection (Schultze, 1991a,b). HE, 
it was proposed, may function primarily as a receptor-destroying enzyme to 
remove glycoconjugates that act as false receptors. 


VI. ESTERASE ACTIVITY AND ITS POSSIBLE ROLE IN VIVO 


The enzymatic activity associated with the HE is neuraminate-O-acetyl- 
esterase, which destroys the receptor determinants on erythrocytes bound by 
BCV, HCV OC43, and HEV (Vlasak et al., 1988a,b). The receptors therefore 
contain 9-O-acetylated sialic acid and the activity is a serine esterase indis- 
tinguishable from the esterase of influenza C virus (Herrler et al., 1985, 1988; 
Vlasak et al, 1987, 1988a,b). The only other viruses possessing receptor- 
destroying activity are the orthomyxovirus types A and B and the paramyxo- 
viruses, for which the receptor-destroying enzyme is neuraminidase. 

Enzymatic association with the hemagglutinin molecule was shown in 
studies using diisopropyl fluorophosphate (Vlasak et al., 1988a), which binds to 
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the serine in the active site and acts as a serine esterase inhibitor (Muchmore 
and Varki, 1987). Diisopropyl fluorophosphate completely inhibited the re- 
ceptor-destroying activity of BCV, and 3H-diisopropyl fluorophosphate treat- 
ment of BCV with subsequent sodium dodecyl sulfate-polyacrylamide gel 
electrophoresis of the protein revealed that the 140-kDa hemagglutinin mole- 
cule was specifically labeled. Presence of the F-G-D-S esterase-active site in the 
amino acid sequence of the hemagglutinin protein (Kienzle et al 1990; Parker 
et al., 1989) and the existence of esterase activity on HE expressed from the 
cloned gene (Parker et al., 1990) confirmed this association. The studies by 
Vlasak et al. (1988a] suggested that the active esterase may be required for either 
endocytosis of the virus and/or uncoating with subsequent release of the virus 
into the cytoplasm during the initial stages of infection. This notion comes 
from the fact that diisopropyl fluorophosphate treatment of BCV destroyed the 
esterase activity and at the same time reduced the infectious titer of the virus 
100- to 400-fold. This treatment, however, did not prevent binding of the virus 
to sialic acid-containing receptors. The acetyl esterase, however, may act at the 
end of the replication cycle by releasing progeny virus from infected cells. In 
this way it would behave analogously to the neuraminidase of influenza A virus 
(Palese et al., 1974). 


VII. CONCLUDING REMARKS 


Many interesting features of HE expression, structure, and function remain 
to be characterized. It remains to be established, for example, what genetic 
mechanisms cause variable expression of HE in some strains of MHV. One 
hypothesis still needing direct proof is that it is the number of UCUAA repeat 
units at the 5’ end of the HE gene that regulates transcription (Shieh et al 1989). 
It also remains to be shown whether the two ORFs within the BCV HE gene 
(IORF 1 and IORF 2) are expressed during virus replication, and, if so, what the 
functions of their products might be. Is their function related to those of HE? 

Regarding HE structure, it remains to be determined where the intra- 
molecular and intermolecular disulfide bridges occur in the dimeric molecule, 
which asparagine residues become glycosylated, and the exact nature of the 
carbohydrate modifications. Detailed mapping will be required to identify the 
sites on the HE molecule involved in hemagglutination, cell binding (if differ- 
ent from hemagglutination), recognition by neutralizing monoclonal anti- 
bodies (Deregt and Babiuk, 1987), and interaction (if any) with other structural 
and nonstructural proteins. Do higher-level noncovalent multimeric forms of 
HE exist? Ultimately it will be important to determine the three-dimensional 
structure of HE and establish the antigenic epitopes on this structure. 

The full range of functions of HE in virus replication remains to be deter- 
mined. Further studies are needed to test whether HE is, in fact, required for 
replication in those viruses that express it constituitively. Is it responsible for 
expanded tissue tropism or host range in those viruses that possess it? If so, by 
what mechanism?! Is it responsible for the strong neuronotropism exhibited by 
the HE-containing coronaviruses? If so, then special importance should be 
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placed on understanding its role since the HE-containing HCV OC43 (and its 
close relatives} is a common human respiratory pathogen and has the potential 
for involving the CNS in human disease. The precise role(s} of the esterase in 
the infection process remains to be determined. 

Finally, the usefulness of HE as a vaccine immunogen for the control of 
BCV-induced gastrointestinal disease should be explored. The existence of HE 
as an integral membrane protein on the surface of infected cells (Kienzle et al., 
1989), as well as a viral structural protein that can induce neutralizing anti- 
bodies (Deregt et al., 1989), suggests that both cellular and humoral immune 
responses are important in protection against BCV. BCV-neutralizing anti- 
bodies can be induced in mice inoculated with an adenovirus recombinant 
expressing HE (Yoo et al., 1992], raising the possibility that protective mucosal 
immunity can be induced by HE. The usefulness of HE in vaccines given by the 
mucosal route should be assessed. 
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CHAPTER 9 


The Small-Membrane Protein 


STUART G. SIDDELL 


I. INTRODUCTION 


Recently, it has become evident that, in addition to the S, M, and N proteins, the 
coronavirus genome encodes a further structural protein, the small-membrane 
or sM protein (Liu and Inglis, 1991; Godet et al 1992; Yu et al 1994). In this 
chapter, I shall describe the structure and expression of the sM protein and 
consider its possible function. Although the available information is modest, it 
already seems clear that the sM protein may have a more important role in the 
biology of coronaviruses than has been recognized previously. 


II. STRUCTURE 


In the coronavirus genomes that have been analyzed todate, the sM protein 
gene is invariably located upstream and adjacent to the M protein gene. The sM 
protein genes encode polypeptides with predicted molecular weights between 
9,100 and 12,400. An optimal alignment of the amino acid sequences of the sM 
proteins of bovine coronavirus (BCV), murine hepatitis virus (MHV|, porcine 
transmissible gastroenteritis virus (TGEV), canine coronavirus (CCV), human 
coronavirus 229E (HCV 229E|, porcine epidemic diarrhea virus (PEDV), and 
avian infectious bronchitis virus (IBV) reveals a number of conserved features 
that are illustrated in Fig. 1. 


e A hydrophobic region located in the amino-terminal half of the polypep- 
tide. This region is flanked by negatively charged residues (amino- 
terminus] and positively charged residues (carboxy-terminus). 
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e A cysteine-rich region adjacent to the hydrophobic domain. 

e A conserved proline residue near the middle of the polypeptide. 

e With the exception of IBV, conserved tyrosine residues in the carboxy- 
half of the polypeptide. 

e An abundance of charged residues in the carboxy-half of the polypep- 
tide. 


These features are indicative of an integral membrane protein and a mem- 
brane location for the sM protein, both in the infected cell and in the virion, has 
been demonstrated (see Section V). The distribution of charged residues on 
either side of the hydrophobic region predict a Nexo-Cendo orientation (Hart- 
mann et al., 1989), but the available experimental data are consistent with a 
Cexo-Nendo configuration. Thus, monoclonal antibodies, specific for epitopes 
located in the C-terminus of the TGEV sM protein, were able to induce cell- 
surface fluorescence in paraformaldehyde-fixed, TGEV-infected swine testicle 
(ST) cells [Godet et al., 1992). Further experiments are needed to assign a 
definitive transmembrane orientation. 

At present, there is no evidence to suggest that the coronavirus sM protein 
is glycoslylated or phosphorylated. However, Yu et al., (1994) have suggested 
that the MHV sM protein is posttranslationally acylated. These authors noted a 
small difference in the electrophoretic mobility of the MHV-A59 sM protein 
synthesized either in infected L2 cells or by in vitro translation of a synthetic 
sM mRNA. This difference was abolished by treatment of the in vivo synthe- 
sized protein with 1 M hydroxlamine at pH 8.0. These data are consistent with 
the linkage of palmitic acid to the sM protein, and it seems reasonable to 
suggest that the cysteine residues located adjacent to the hydrophobic domain 
of the protein may be the sites of esterification. It should be noted, however, 
that Godet et al. (1992) were unable to detect the incorporation of palmitic acid 
chains in a recombinant TGEV sM protein expressed in insect cells. An sM 
protein gene has not been identified in the Berne virus (BEV) genome. 


III. EXPRESSION 


The mRNAs that encode the sM proteins of coronaviruses are listed in 
Table I. In the cases of HCV 229E, PEDV, TGEV, and CCV, these assignments 
are based mainly on colinearity of the genomic sM open reading frame (ORF) 


TABLE I. sM Protein mRNAs 








Virus mRNA ORFs Expression Reference 

IBV à 3a, 3b, 3c (SM) Tricistronic Liu et al. (1991) 

MHV 5 5a, 5b (sM) Bicistronic Budzilowicz and Weiss (1987) 
HCV 229E 5 sM Monocistronic Raabe et al. (1990) 

BCV 5.1 sM Monocistronic Abraham et al. (1990) 

PEDV 4 sM Monocistronic Duarte et al. (1994) 

TGEV 4 sM Monocistronic Rasschaert et al. (1987) 

CCV 4 sM Monocistronic ` Horsburgh et al. (1992) 
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and the 5' unique region of the mRNA, and by extrapolation of the coronavirus 
translational "model," which, in fact, was deduced by the analysis of other 
mRNAs (Rottier et al., 1981; Leibowitz et al., 1982; Siddell, 1983; Stern and 
Sefton, 1984; Jacobs et al., 1986; de Groot et al., 1987). In general, the sM 
mRNAs contain only one ORF in their 5' unique region, and, in these cases, the 
mRNAs almost certainly function monocistronically. Accordingly, they may 
now be called sM mRNAs. There are, however, two exceptions. First, the 
mRNA 5 of MHV contains two ORFs (5a and 5b) in its unique region, and 
second, the mRNA 3 of IBV contains three ORFs (3a, 3b, and 3c) in its unique 
region. In both cases, the sM gene is the 3' proximal ORF In these two special 
cases, there is experimental evidence that the mRNAs function bi- or tri- 
cistronically, respectively. 


IV. IN VITRO STUDIES 
A. MHV mRNA 5 


The MHV mRNA 5 contains two ORFs in its unique region. These ORFs 
overlap by five nucleotides. The upstream ORF, ORF 5a, is predicted to encode a 
basic protein of 12,500-13,000 molecular weight. The function of this protein is 
unknown, and it has not yet been detected in MHV-infected cells. With the 
exception of the AUG codon initiating the downstream ORF, no internal AUG 
codons are found in the sequence covered by ORF 5a. The downstream ORF, 
previously designated as ORF 5b, encodes the sM protein (Skinner et al., 1985; 
Budzilowicz and Weiss, 1987). The in vitro translation of synthetic RNAs 
transcribed from cDNAs containing both ORF 5a and the sM ORF led Budzilo- 
wicz and Weiss (1987) to conclude that the MHV-A59 mRNA 5 is functionally 
bicistronic. 

If the MHV mRNA 5 is functionally bicistronic, then expression of the sM 
ORF has to involve the internal initiation of protein synthesis. Two models can 
be proposed. First, as originally suggested on the basis of the structural data 
alone (Skinner et al 1985), a mechanism involving leaky ribosome scanning 
(Kozak, 1989) is possible. Alternatively, and especially in the light of recent data 
on the translation of picornavirus RNAs (Meerovitch and Sonenberg, 1993), a 
cap-independent mechanism involving ribosome entry at an internal position 
on the MHV mRNA 5 has to be considered. 

Recently, Thiel and Siddell (1994) have analyzed the in vitro translation of 
synthetic mRNAs that contain the unique region of MHV (strain JHM) mRNA 
5 preceded by an ORF derived from the B-galactosidase gene of Escherichia coli. 
These experiments show that the B-galactosidase ORF is an effective barrier to 
the movement of ribosomes from the 5' end of the mRNA but, nevertheless, the 
sM ORF is efficiently translated. The authors conclude that the translation of 
the sM ORF is mediated by the internal entry of ribosomes. In vivo studies 
using reporter gene constructs and the naturally occurring mRNA 5 are re- 
quired to strengthen this conclusion. However, if correct, it will be great inter- 
est to examine interactions between the putative MHV mRNA 5 "internal 
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ribosome entry site" (IRES) and trans-acting factors that, directly or indirectly, 
mediate the initiation of protein synthesis at the sM ORF. 


B. IVB mRNA 3 


The unique region of the IBV mRNA 3 contains three ORFs. The upstream 
ORFs, ORF 3a and ORF 3b, encode proteins of 6700 and 7400 molecular weight, 
respectively. The function of these polypeptides is unknown, although it has 
been shown that they are expressed in IBV-infected chick kidney (CK) cells (Liu 
et al., 1991). Again, it seems noteworthy that, with the exception of the AUG 
that initiates the 3' proximal ORF, there are no internal AUG codons in the 
sequences covered by ORFs 3a and 3b. The 3' proximal ORE, previously desig- 
nated ORF 3c, encodes the sM protein (Boursnell et al., 1985). In vitro transla- 
tion studies using synthetic mRNAs containing the 3a, 3b, and sM ORFs 
strongly suggest that all three proteins can be translated from a single molecular 
species, indicating that the IBV mRNA 3 is functionally tricistronic (Liu et al, 
1991). 

As with the MHV mRNA 5, there is evidence that the IBV sM ORF of 
mRNA 3 is expressed by a mechanism involving the internal entry of ribo- 
somes. Thus, Liu and Inglis (1992), again using synthetic mRNAs and in vitro 
translation, were able to show that a tricistronic mRNA whose peculiar 5’ end 
structure prevents the translation of the 5’ proximal ORFs (3a and 3b) directs 
the synthesis of sM normally. Moreover, the translation of sM, unlike that of 3a 
and 3b, was insensitive to the presence of the 5' cap analogue, 7-methyl GTP, 
andit was unaffected by alteration of the sequence contexts for initiation on the 
3a and 3b ORFs. 

The same authors (Liu and Inglis, 1992) were able to show that an mRNA, 
in which the IBV 3a/3b/sM ORFs were placed downstream of the influenza A 
virus nucleocapsid protein gene, directed the efficient synthesis of sM, as well 
as nucleocapsid protein, whereas initiation at ORFs 3a and 3b could not be 
detected. Furthermore, expression of the sM ORF from this construct was 
abolished when the 3a and 3b regions were deleted. This suggests that this 
region contains a putative IRES element responsible for sM expression. 

In a theoretical paper, Le et al. (1994) performed Monte Carlo simulations 
of RNA folding in the unique region of the IBV mRNA 3 and determined that a 
region of significant folding occurred prior to the initiation codon of the sM 
ORF. This region encompassed 265 nucleotides within the ORF 3a and ORF 3b 
coding regions. By computer modeling, the authors were then able to predict, 
within this region, a tertiary structure containing five highly significant RNA 
stem-loops modeled into a compact superstructure by the interaction of two 
pseudoknots. A further interesting feature of this model is the presence of 
putative base complimentarity between unpaired nucleotides in the mRNA 
together with 18S ribosomal RNA. This interaction would be located imme- 
diately upstream of the sM ORF initiation codon. The authors point out that 
these are features also found in their predictions of tertiary structure for picor- 
navirus IRES elements (Le et al., 1992, 1993) 
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The translation mechanism of the monocistronic sM mRNAs of corona- 
viruses has not been studied in any detail (see, however, Abraham et al., 1990). 
In some cases, for example, the HCV 229E sM mRNA, there is a lengthy 5' NTR 
that could, at least theoretically, encompass an IRES element. In other cases, for 
example, the PEDV sM mRNA, the 5' NTR is predicted to be relatively short. 


V. IN VIVO STUDIES 


Only a small number of studies have convincingly demonstrated the ex- 
pression of the sM protein in coronavirus-infected cells and its incorporation 
into virion particles. The standard approach has been to construct bacterial 
fusion proteins containing sM protein sequences and to generate monoclonal 
antibodies or polyclonal sera which are then used in immunoprecipitation (IP], 
immunocytochemistry, or immunofluorescence (IF). 

First, Smith et al. (1990) used an sM-fusion protein specific polyclonal 
serum to immunoprecipitate an approximately 12,000 molecular weight pro- 
tein (the expected size for the IBV sM polypeptide is 12,400) from infected CK or 
Vero cells. As judged by cell fractionation and IP, the sM protein was associated 
with a large membrane fraction of the cell. IF studies demonstrated a juxta- 
nuclear location with the transport of at least some sM protein to the cell 
surface. These conclusions were essentially confirmed by Liu et al. (1991). 

Liu and Inglis (1991) subsequently demonstrated, for the first time, the 
association of the IBV sM protein with the virion envelope. Using the same 
polyclonal antiserum, these authors were able to immunoprecipitate the sM 
protein from highly purified radiolabeled preparations of IBV. Moreover, the sM 
protein was found to cofractionate with the other virion envelope proteins (S 
and M) upon detergent disruption of virus particles. The molar ratios of the 
S:N:M:sM proteins in purified IBV virions was calculated to be approximately 
1:11:10:2. 

Second, Godet et al. (1992), using monoclonal antibodies generated by 
immunization with a recombinant sM protein expressed in insect cells, were 
able to detect a 10,000 molecular weight protein (the predicted size of the TGEV 
sM polypeptide is 9200) in TGEV-infected porcine (PD5) cells. Indirect IF 
showed that the subcellular location of the sM protein was juxtanuclear and, 
again, a proportion of sM molecules were detected at the cell surface. Further- 
more, incorporation of the sM protein into TGEV virions was detected by the 
analysis of purified, radiolabeled virus and the sM protein could be immuno- 
precipitated from the virion membrane fraction using monoclonal antibodies. 
The molar ratios of the TGEV sM:S:M proteins were calculated to be 1:20:300. 

Third, Leibowitz et al. (1988), using an MHV sM-fusion protein specific 
polyclonal antiserum were able to detect expression of the sM protein in MHV- 
infected L2 cells by immunocytochemistry, or IP followed by one- or two- 
dimensional polyacrylamide gel electrophoresis. Subsequently, Yu et al. (1994) 
were able to show that the MHV sM protein is located in the virion envelope. In 
these studies, a radiolabeled protein, with the same molecular weight as the sM 
protein immunoprecipitated from MHV infected L2 cells, could be detected in 
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highly purified MHV virions. Additionally, these authors were able to show 
that the rabbit antiserum was able to neutralize viral infectivity in the presence 
of complement. The IF staining of MHV-infected cells with two goat antipep- 
tide antisera also revealed that the sM protein is membrane associated and, at 
least partially, transported to the cell surface. Similar IF data have been ob- 
tained for BCV-infected cells, using the MHV sM specific antiserum (Abraham 
et al., 1990). 


VI. FUNCTION 


There are no published experimental data on the function of the corona- 
virus sM protein. However, it appears to be essential because, in contrast to 
some other coronavirus genes [e.g., ns2 of MHV; Schwarz et al. (1990]], there 
have been no reports of viable coronaviruses with a defective sM gene. Never- 
theless, almost all authors have speculated on a possible role for the protein, 
namely, in the assembly of virions. This is because small integral membrane 
proteins have been described in several enveloped viruses, including alpha- 
viruses, influenza viruses, and paramyxoviruses (Garoff et al., 1980; Welch and 
Sefton, 1980; Gaedigk-Nitschko and Schlesinger, 1990; Lamb et al., 1985; 
Hiebert et al., 1985; Olmsted and Collins, 1989) and they have been implicated 
in the assembly process. Thus, for example, site-directed mutagenesis studies 
on Semliki Forest virus have shown that the sM (6 kDa) protein probably has a 
role in virus assembly and the budding process (Liljstróm et al., 1991) 

Recently, a system has been developed that may provide important insights 
into the coronavirus assembly process. This system is based on the cellular 
coexpression of coronavirus structural proteins, using a T7/vaccinia virus- 
based format, and the detection of viruslike particles in the culture supernatant 
(H. Venemma, Utrecht and E. Bos, Leiden, personal communication). Prelimi- 
nary results suggest that the sM protein may, indeed, have an important role in 
particle assembly and/or budding. 
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CHAPTER 10 


The Coronavirus 
Nonstructural Proteins 


T. D. K. BROWN AND I. BRIERLEY 


I. INTRODUCTION 


The definition of a group of virus-coded proteins/polypeptides, the nonstruc- 
turals, as those not found in virions is crude, but has some merit in that it 
immediately focuses attention on the potential difficulties encountered in 
defining them. It leaves aside the practical difficulties implicit in attempting to 
detect small amounts of virus-coded polypeptides, particularly if they are of low 
molecular weight, which may be present in virions, and, on the other hand, in 
deciding whether or not such polypeptides, if detected, are present adven- 
titiously. This chapter will thus consider currently available information relat- 
ing to coronavirus polypeptides (or predicted polypeptides) other than those 
routinely present in the virions of all coronaviruses subjected to detailed anal- 
ysis, i.e., the spike glycoprotein (S), the integral membrane glycoprotein (M), the 
small-membrane protein (sM), and the nucleocapsid protein (N), or present in 
only some coronaviruses, Le, the hemagglutinin esterase (HE) glycoprotein. 
A brief consideration of the range of methods that have been employed to 
detect nonstructural proteins in virus-infected cells and the limitations of such 
methods is relevant to an analysis of available data on coronavirus nonstruc- 
tural proteins. Prior to the introduction of techniques for molecular cloning, 
only a limited number of approaches were available for detecting nonstructural 
polypeptides. The most obvious of these was the detection, using radiolabeling 
techniques, of products absent from virions but present in infected cells. The 
success of this type of approach is dependent in part on the efficiency of host 


T. D. K. BROWN AND I. BRIERLEY eœ Virology Division, Department of Pathology, Cambridge 
University, Cambridge CB2 1QP, England. 


The Coronaviridae, edited by Stuart G. Siddell, Plenum Press, New York, 1995. 


191 


192, T. D. K. BROWN anb I. BRIERLEY 


cell shutoff; this is frequently inefficient in coronavirus-infected cells. The in 
vitro translation of purified virus mRNAs provided a potential alternative 
approach to identification of virus-coded polypeptides not present in virions. 
Assay of enzymatic activity predicted to be associated with a nonstructural 
protein, e.g., virus-coded RNA-dependent RNA polymerase activity, is also 
possible. These approaches have been applied with varying success to corona- 
viruses. 

The prospects for analyzing nonstructural proteins were dramatically im- 
proved by the advent of molecular cloning techniques and consequent ease of 
sequence determination. The cloning and sequencing of coronavirus cDNAs 
has revealed substantial numbers of open reading frames {ORFs} in the unique 
regions of virus mRNAs known not to be involved in the synthesis of virion 
proteins. In most cases, no candidate product of these ORFs had previously been 
detected in infected cells. Cloned sequences can be employed to develop re- 
agents for the detection of putative nonstructural proteins coded by the ORFs; 
production of monospecific antisera is particularly valuable in this context. 

Detection is, however, only an initial, albeit essential, step in the charac- 
terization of nonstructural proteins. A range of functional studies are ulti- 
mately required for an understanding of the roles of these proteins; such studies 
have scarcely been started for coronaviruses. We are thus, at the present time, 
aware of the existence of a substantial number of ORFs potentially coding for 
nonstructural polypeptides without, in most cases, significant insights into 
their functions other than those which can be gained by more-or-less sophisti- 
cated computer-based sequence analyses. 
= The minimum function (RNA-dependent RNA synthesis] of the nonstruc- 
tural proteins encoded by mRNA 1 (genomic/virion RNA) is defined by its 
infectivity (Lomniczi and Kennedy, 1977; Schochetman et al., 1977]; this makes 
it unique among coronavirus nonstructural proteins. For this reason the prod- 
ucts encoded by mRNA 1s will be considered together for all coronaviruses; 
other ORFs for which no such unifying knowledge of function exists will be 
considered on the basis of individual viruses or groups of related coronaviruses. 


II. PRODUCTS OF mRNA 1 
A. Sequence Analysis of the Unique Regions of mRNA 1s 


1. General Analysis 


Four complete sequences of RNA 1 unique regions are currently available: 
those of avian infectious bronchitis virus (IBV) strain Beaudette, murine hepa- 
titis virus (MHV| strains JHM and A59, and human coronavirus (HCV) strain 
229E. Sequencing of the putative unique region of mRNA 1 of IBV represented 
in 17 overlapping cDNA clones (Boursnell et al., 1987) provided the first con- 
vincing evidence that this region of the coronavirus genome had an extremely 
large coding capacity (approximately 20 kb in the case of IBV] and that it 
contained two overlapping ORFs (1a and Ib) capable, in the case of IBV, of coding 
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for polypeptides of 441,000 and 300,000, Da respectively. The downstream 1b 
ORF was found to be in the —1 frame with respect to the upstream la ORF. 
There was a 42-nucleotide overlap between the ORFs. The implication was that 
coronavirus polymerases were likely to be complex and that their components 
might be expressed by an unusual mechanism. 

The unique region of RNA 1 of MHV strains JHM and A59 were subse- 
quently cloned and sequenced; they are approximately 22-kb long and again 
were found to contain two overlapping ORFs (la and 1b} (Baker et al., 1989; 
Bonilla et al., 1994; Bredenbeek et al. 1990b; Lee et al 1991; Pachuk et al., 1989; 
Soe et al., 1987). The relationship between the la and 1b ORFs is similar to that 
seen in IBV; the 1b ORF is in the —1 frame with respect to la and there is a 75- 
nucleotide overlap between them. Attention has been drawn to apparent minor 
errors in the published sequence of MHV-JHM and also to a region of genuine 
heterogeneity detected also in MHV-2 (Bonilla et al., 1994). 

The cloning and sequencing of the HCV 229E mRNA 1 unique region has 
also been reported (Herold et al., 1993). Again it was found to be large (approx- 
imately 20 kb) and to contain two overlapping reading frames with the 1b ORF 
being in the —1 frame with respect to the la ORF and overlapping it by 43 
nucleotides. A summary of the major features of the available mRNA 1 unique 
region sequences are presented in Table I. 

The overall relationships between the amino acid sequences of the pre- 
dicted products of the la and 1b ORFs have been determined by computer 
analysis (summarized in Table I). It is clear that the 1b ORF is much more highly 
conserved than the la ORF, and that within la the greatest divergence is seen at 
the amino-terminus. The substantial additional sequences present in MHV la, 
but not in IBV la, are found at the 5’ end (Bredenbeek et a1., 1990b; Herold et al., 
1993; Lee et al., 1991; Pachuk et al., 1989; Soe et al., 1987). The high degree of 
amino acid sequence conservation in lb is also indicated by the limited data 
available for porcine transmissible gastroenteritis virus (TGEV), feline infec- 
tious peritonitis virus (FIPV], and canine coronavirus (CCV). This high degree of 
conservation is in marked contrast to that observed for other coronavirus 
nonstructural proteins (see Sections III to X). 


TABLE I. Summary of the Features of Sequenced mRNA 1 Unique Regions? 





IBV Beaudette MHV-JHM | MHV A59 HCV 229E 





Length of la ORF (aa) 3951 4488 4468 4085 
Length of 1b ORF (aa) 2691 2731 2733 2686 
Length of 1a/1b fusion (aa)? 6629 7203 7176 6758 
la/1b Overlap (nt) 42 75 75 43 
la Amino acid identity to IBV (96) (100) 29 29 27 
la Amino acid similarity to IBV (96) (100) 52 52 51 
Ib Amino acid identity to IBV (%! (100) 56 57 54 
lb Amino acid similarity to IBV (96) (100) 72 72 70 


?Data obtained from published sequences using the GCG GAP program with default settings. 
bbased on simultaneous slippage model of Brierley et al. (1987, 1989, 1992). 
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2. Motif Analysis 


Computer-based motif analyses, in the absence of substantial efforts to 
obtain experimental data, have been extensively applied to the nucleotide 
sequences of the mRNA 1 unique regions and to the amino acid sequences 
derived from them. Initial analyses of the IBV sequence demonstrated amino 
acid homologies in the Ib ORF with RNA-dependent RNA polymerase-related 
polypeptides of Sindbis virus, an alphavirus, and brome mosaic virus, a tri- 
cornavirus; these lay between amino acids 1200 and 1500 (Boursnell et al., 1987). 
More sophisticated analyses of the la and Ib ORFs were subsequently carried 
out (Gorbalenya et al., 1988a,b, 1989b; Hodgman, 1988). The homologies de- 
tected by Boursnell et al. (1987) were confirmed and their relationships ex- 
tended using sequences from a wide range of RNA viruses. It was suggested that 
they included an nucleoside triphosphate (NTP} binding domain characteristic 
of helicases. 

Other motifs characteristic of positive-strand virus RNA-dependent RNA 
polymerases were also detected. The most “obvious” of these was the cryptic 
GDD motif present in the IBV sequence as SDD. A region with extremely high 
homology to potyvirus polymerases is noteworthy. Various other features were 
identified in the la and 1b ORFs. These included a potential zinc finger nucleic 
acid binding domain in 1b, a homologue of the 3C protease of picornaviruses in 
la, and 13 putative Q/G;S target sites distributed in both la and 1b. Homology to 
a fragment of the catalytic center of another class of cysteine protease from 
Streptococcus pneumoniae (SPL protease) was detected in la. Attention was 
also drawn to a possible growth factor-related domain and two potential mem- 
brane spanning regions in la. 

The availability of the IBV sequence and the analytical abilities of the 
Russian group laid the foundations for the dissection of other coronavirus 
mRNA 1 sequences as they became available. The possibilities for both narrow 
and broad phylogenetic comparisons are now more extensive and it is possible 
to consider the features of a core set of coronavirus mRNA 1 motifs, to which 
putative functions can be ascribed, and their relationships to both viral and 
cellular homologues. Those currently identified are: in la (1) a papainlike cys- 
teine protease domain(s}, (2) a chymotrypsin/picornaviral 3C proteaselike do- 
main, and (3) a cysteine-rich growth factor-related domain; and in lb (4) a 
classical positive-strand RNA-dependent RNA polymerase domain, (5) a zinc 
finger nucleic acid binding domain, and (6) an NTP binding/helicase domain. To 
this list it is perhaps reasonable to add the conserved membrane spanning 
domains. The locations of these motifs/domains are presented in Fig. 1 and 
Table II. 


a. Papainlike Cysteine Protease Domains 


Viral homologues of the classical papainlike proteases (PLP) were first 
identified in poty- (Oh and Carrington, 1989) and alphaviruses (Hardy and 
Strauss, 1989; Strauss et al., 1992). The identities of their putative active site 
cysteine and histidine residues were confirmed by site-directed mutagenesis. 
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NH, COOH MHV 
PLP1 X PLP2 3CLP GFL POL MB HEL 
NH, COOH HCV 
X PLP2 3CLP GFL POL MB HEL 
No COOH IBV 
SPL 
7 6 5 4 3 2 1 0 
a ee an sj 
kDa (x 10°5) 


FIGURE 1. The positions of motifs identified in the predicted polypeptide sequence of the la/b 
polyprotein encoded by the mRNA 1 unique regions of IBV, MHV, and HCV are shown. PLP - 
papain-like protease, 3CLP = 3C-like protease, GFL = growth factor-like, POL = RNA-dependent 
RNA polymerase, MB - metal binding, HEL - helicase. The data are from computer assisted 
analyses (Gorbalenya et al., 1989; Lee et al 1991; Herold et al., 1993). 


Homologues were subsequently identified in coronaviruses (Gorbalenya et al., 
1989b; Herold et al., 1993; Lee et al., 1991). Two PLP domains are present in both 
MHV and HCV, but only one is present in IBV. The IBV domain lies upstream of 
and overlaps the SPL protease domain. The role of the upstream PLP (PLPI) 
domain in the cleavage of p28 from the amino-terminus of MHV 1a has been 
demonstrated (Baker et al., 1989, 1993). It has been suggested that this PLP 


TABLE II. Positions of Motifs within the Predicted 


Amino Acid Sequences of mRNA la and b ORFs4 


IBV MHV-JHM HCV 

ORF la 

PLPI — 1100-1349 1041-1234 

PLP2. 1236-1497 1696-1953 1688-1886 

SPL 1385-1677 — — 

3CLP 2811-2943 3382-3517 2965—3268^ 

GFL 3894-3917 4446-4468 3933-40695 
ORF 1b 

POL 588-818 560-852, 534-836 

MB 931-1005 942-1015 924—999 

HEL 1209-1500 1217-1506 1202-1330 


aBased on data from Gorbalenya et al. (1988a,b, 1989a, b, 1991), 
Herold et al. (1993), Koonin [1991], and Lee et al. (1991). 

PBoundaries of predicted processed polypeptide-containing motif 
rather than those of motif itself (based on assumptions about tar- 
gets for the 3CLP). 
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domain is particularly closely related to the leader protease of foot-and-mouth 
disease virus (FMDV) (Gorbalenya et al 1991). There are no data available on 
the role of the downstream PLP domain (PLP2) of MHV. There are similarly no 
published data on the role of either of the PLP domains of HCV 229E or of the 
single PLP domain of IBV. The IBV PLP is homologous to the downstream PLP2 
domains of MHV and HCV. 

It is interesting that the arteriviruses, which contain a subset of the corona- 
virus polymerase sequences in their mRNA 1 unique regions, encode a PLP 
domain in a vestigial la ORF and that it cleaves la to generate a 30-kDA 
polypeptide. The cleavage is, however, downstream of the PLP domain (Snijder 
et al., 1992). Attention has also been drawn to a region highly conserved be- 
tween IBV and MHV, the so-called X domain, which is present immediately 
upstream of the IBV PLP domain but immediately downstream of the MHV 
PLP1 domain. A related domain has been detected close to the thiol protease 
domains of alpha- and rubiviruses (Gorbalenya et al., 1991). 


b. Chymotrypsin/Picornaviral 3C-like Protease Domains 


The chymotrypsin/picornaviral 3C-like protease (CLP) domains of approxi- 
mately 300 amino acids with homologies to proteases identified in picorna-, 
como-, nepo-, poty-, sobemo-, and luteoviruses are present in the 3’ region of the 
la ORFs of IBV, MHV, and HCV. The proposed catalytic histidine and cysteine 
residues initally identified in IBV are present in MHV and HCV (Herold et al., 
1993; Lee et al., 1991). Unusual features of the coronavirus 3CLPs include, first, 
the lack of conservation in MHV and HCV of what appeared to be a conserved, 
catalytically active acidic residue in IBV and other viral 3CLPs (Gorbalenya et 
al., 1989b), and, second, the substitution of coronavirus sequences of tyrosine 
for the conserved glycine normally seen in the putative 3CLP active site sub- 
strate binding region (Herold et al., 1993; Lee et al., 1991). Attempts have been 
made to predict cleavage sites for the putative 3CLP, based largely on studies of 
picornaviral enzymes; the cleavages occur primarily at Q,E/G,S,A but are selec- 
tive, and the basis for the selectivity is unclear. Gorbalenya et al. (1989) derived 
a consensus sequence for IBV including Q/S or G using approaches based on 
comparison of sequences surrounding the Q/S and Q/G sites flanking putative 
functional domains. Further potential cleavage sites conforming to the con- 
sensus were then identified; a total of 13 sites was reported. It was suggested 
that a significant feature of the consensus was the presence of a hydrophobic 
residue at —1, which is a general feature of 3CLP target sequences. The sites 
flanking the 3CLP domain both had a positively charged residue at —3. Conser- 
vation of at least some of these potential cleavage sites has been observed in 
MHV and HCV (Herold et al., 1993; Lee et al., 1991). The membrane domains 
flank the 3CLP domain. 


c. Cysteine-Rich Growth Factor-Related Domains 


This type of sequence is found in the predicted 1a polypeptide sequences of 
IBV, MHV, and HCV (Gorbalenya et al., 1989b; Herold et al., 1993; Lee et al., 
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1991). It is clear that it is exceptionally cysteine-rich, but its relationship to 
growth factors may be distant and evidence of functional relationships to 
growth factors is lacking. 


d. RNA-Dependent RNA Polymerase Domains 


The amino acid sequence relationships between positive-strand virus 
RNA-dependent RNA polymerases are well established (Argos, 1988; Koonin, 
199]). Notable features include the high degree of conservation of this region 
between IBV, MHV, and HCV, in particular the finding that the coronavirus 
sequences all contain an SDD motif rather than the classical GDD motif. No 
evidence for a second polymerase domain has been obtained. 


e. Zinc Finger Nucleic Acid-Binding Domain 


This domain, characterized by the presence of a defined series of histidine 
and cysteine residues, is again clearly conserved in the three coronavirus 1b 
sequences; but some of the structural features proposed on the basis of the IBV 
sequence cannot be present in the MHV or HCV polypeptides. Evidence for 
metal binding has yet to be obtained. 


f. NTP Binding/Helicase Domain 


The homology between this region of the coronavirus genome and se- 
quences present in other positive-strand RNA viruses was detected in initial 
analyses of the IBV sequence, but its significance was not appreciated until the 
work of Gorbalenya and Koonin (Gorbalenya et al., 1988a,b, 1989a,b}. Not 
surprisingly, the domain is well conserved in MHV and HCV. 


B. Expression of the la and 1b ORFs in Vitro 


Cell-free approaches have been used to analyze the translation of genomic 
RNA/mRNA 1. The translation of both genomic RNA preparations and in vitro 
transcripts prepared from cloned cDNAs has been studied. Virion RNA of MHV 
was translated in rabbit reticulocyte lysate (RRL) with the production of three 
structurally related polypeptides of molecular weight greater than 200 kDa 
(Leibowitz et al., 1982). A series of minor products of lower molecular weight 
could also be discerned. These observations have been extended in a number of 
ways (Denison and Perlman, 1986}. A series of products were detected in RRL 
translations with two quantitatively significant products with molecular 
weights greater than 200 kDa being emphasized (p220 and p250). A well-defined 
product of low molecular weight was characterized. It was demonstrated to be 
N-terminal and to be generated by proteolytic cleavage from a relatively large 
precursor. The precursor—product relationships of p28 and the discrete high- 
molecular-weight products was not however completely clarified. The inhibi- 
tory effects of leupeptin, a reversible inhibitor of trypsinlike serine/cysteine 
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proteases, and ZnCl, on p28 cleavage were demonstrated. Both these studies 
were carried out prior to the publication of the IBV polymerase sequence (Bours- 
nell et al., 1987), which revealed for the first time the potential complexity 
of the coronavirus polymerase "problem." 

The observations of Denison and Perlman (1986) were extended by Soe et 
al. (1987) using in vitro translation of in vitro transcripts prepared from the 
5'-most 2 kb of the MHV genome. They confirmed the N-terminal location of 
p28. They also demonstrated that deletion of sequences upstream of the poten- 
tial initation codon at position 215 increased the efficiency of translation in 
RRL of in vitro transcripts derived from the 2 kb fragment. It was suggested that 
the low levels of translation observed when the 5’ untranslated region (UTR) is 
intact result from the presence of highly stable RNA secondary structures. The 
work on MHV was extended (Baker et al., 1989) with the demonstration that 
sequences involved in the cleavage of p28 were located between 3.9 and 5.3 kb 
from the 5’ end of the MHV genome and the observation that the protease was 
active only in cis. Following the identification of two PLP domains in the MHV 
la sequence (Lee et al., 1991), a more detailed examination of the sequences 
responsible for the cleavage of p28 was carried out (Baker et al., 1993). The 
catalytic functions of the putative active site cysteine and histidine residues 
of PLP2 identified by computer-based analysis were confirmed by site-directed 
mutagenesis. Its activity was shown to be sensitive to the deletion of at 
least some of the polypeptide sequences between it and the N-terminal target 
site. 

In vitro translation of MHV genomic RNA has been used to study process- 
ing of the polypeptide sequences encoded by the 1b ORF (Denison et al., 1991). 
A series of polypeptides recognized by antisera raised against the carboxy- 
terminus of ORF lb were identified. Leupeptin, in contrast to its effect on p28 
cleavage, did not alter the pattern of cleavage observed for the 1b products, thus 
providing preliminary evidence for a role for at least two proteolytic activities 
in the cleavage of the MHV gene 1 polypeptides. It proved impossible, however, 
to align convincingly the cleavage products (apparent molecular weights, 90, 
74, 53, 44, and 32 kDa) with predicted 3CLP cleavage sites in 1b. 

An important question that arose following completion of the sequence of 
IBV was the mechanism by which ORF Ib is expressed, since it overlaps la by 42 
nucleotides and is in the —1 reading frame with respect to la. A number of 
observations raised the possibility that 1b may be expressed as a fusion protein 
with the upstream la ORF following a —1 ribosomal frameshift at the overlap 
region of the two ORFs (Boursnell et al., 1987). First, the 70 or so bases preceding 
the first AUG codon of 1b were found to have strong codon bias, similar to the 
bias found in other IBV genes, suggesting they had coding function. Second, no 
subgenomic mRNA with 1b as its 5’-proximal ORF had been detected in IBV- 
infected cells. Third, nucleotide sequence comparisons of the la/lb overlap 
region with the gag—pol overlap region of Rous sarcoma virus (RSV) revealed a 
short but significant homology (8 out of 9 bases conserved) in the region where 
frameshifting was suspected to occur in RSV (Jacks and Varmus, 1985). 

In order to test the possibility of ribosomal frameshifting in IBV, a region of 
cDNA corresponding to the la/1b overlap region was cloned within a reporter 
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gene and tested for frameshifting by in vitro transcription and translation 
(Brierley et al., 1987). It was confirmed that the la/1b region indeed specified a 
highly efficient —1 frameshift, with one in three ribosomes changing frame 
within the overlap region. This was the first nonretroviral example of the 
phenomenon. The IBV frameshift signal was also functional in vivo; synthetic 
transcripts containing the appropriate signals when injected into Xenopus oo- 
cytes produced the expected ratio of nonframeshifted to frameshifted products 
(Brierley et al., 1990). 

In further experiments, the signals for frameshifting were investigated 
using site-directed mutagenesis (Brierley et al., 1991, 1992). These signals were 
shown to be located within an 86-nucleotide stretch encompassing the la/1b 
overlap and were comprised of two main elements [see Fig. 2). The first was the 
sequence UUUAAAC, which was shown to be the actual site of the frame 
change; of the ribosomes that enter in the la frame (U-UUA-AAC}, some 30% 
leave in the 1b frame (UUU-AAA). This "slippery sequence” in itself was 
insufficient to evoke the change of frame, and additional information down- 
stream is required. This information is in the form of an RNA pseudoknot, an 
unusual kind of RNA structure composed of two base-paired regions stacked 
coaxially in a quasi-continuous manner and connected by two single-stranded 
loop regions (Pleij and Bosch, 1989]. The results indicated that no primary 
nucleotide sequence elements in the pseudoknot were required for the frame- 
shift process; as long as the overall shape and predicted stability of the structure 
was maintained, frameshifting was highly efficient. 

Frameshifting is thought to occur by simultaneous slippage into the —1 
reading frame of two ribosome-bound tRNAs present in the aminoacyl and 
peptidyl sites of the ribosome during decoding of the slippery sequence. The 
role of the pseudoknot in this process is uncertain; the most plausible explana- 
tion advanced so far is that the presence of the pseudoknot may slow or stall the 
ribosome as it translates the slippery sequence, allowing realignment of the 
decoding tRNAs on the mRNA in a new frame (Jacks et al., 1988). Consistent 
with this idea is that the RNA pseudoknot must be within 5 to 7 nucleotides of 
the slip site for efficient frameshifting to occur in the IBV signal (Brierley et al., 
1992), and additionally, that pausing at RNA pseudoknots can be demonstrated 
in vitro (Somogyi et al., 1993; Tu et al., 1992). 

It is clear from recent work that the frameshift expression strategy appears 
to be conserved among coronaviruses (see Fig. 2). Frameshift signals have been 
described for MHV strains A59 (Bredenbeek et al., 1990b) and JHM (Lee et al., 
1991) and also for HCV strain 229E (Herold et al., 1993; Herold and Siddell, 
1993). In each case, the slippery sequence UUUAAAC is used and is followed by 
an RNA pseudoknot. For IBV and MHV, the pseudoknots are similar in terms of 
the size of the stems and the loop lengths, although the stem 2 of MHV is 
predicted to be longer. The pseudoknot of HCV 229E, however, forms a more 
complex structure. Loop 2 of the HCV pseudoknot is 164 nucleotides in length, 
by far the longest loop 2 seen in those pseudoknots involved in frameshifting. 
Furthermore, the nucleotides that form the second arm of stem 2 of the pseudo- 
knot are themselves part of a hairpin loop that is predicted to form at the end of 
loop 2 (Herold and Siddell, 1993). This generates an additional stem, stem 3. It is 
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FIGURE 2. RNA pseudoknot structure associated with coronavirus frameshift sites. The italicized 
bases in the MHV A59 sequence are those that differ from those in the homologous IBV sequence 
(Boursnell et al., 1987; Bredenbeek et al., 1990). The frameshift signal of MHV JHM is identical to 
that of MHV A59 with exception of the indicated A to G change (Bredenbeek et al., 1990; Lee et al., 
1991). In the case of each frameshift signal, the la termination codon is indicated by white, 
individually boxed characters. The data for HCV 229E are from Herold et al. (1993). 


possible that the formation of this hairpin allows stem 2 to form more readily, 
compensating both for the extra length of loop 2 and the relative instability of 
the HCV stem 2, which has only five base pairs instead of the six base pairs seen 
for the other coronaviruses. 

The discovery of the Geen at the IBV frameshift site prompted a re- 
evaluation of the components of retrovirus frameshift signals. It became clear 
that many retroviral frameshift sites also had the potential to form RNA pseudo- 
knots (Brierley et al., 1989; ten Damm et al., 1990). Over the past few years, 
several of these signals have been investigated and the frameshift efficiency of 
the sites measured in vitro. It seems that the coronaviruses have the most 
efficient frameshift signals, with 30-40% of ribosomes changing frame. For 
retroviruses, the efficiency varies from a few percent up to 20%. Clearly, the 
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combination of the UUUAAAC sequence and a long, stable RNA pseudoknot 
in coronaviruses provides a very efficient frameshift signal. Why coronaviruses 
employ a frameshift strategy remains to be determined, but there are a number 
of possibilities. The most obvious is the likely requirement for production of a 
defined ratio of la:la/1b products. In retroviruses, frameshifting generates the 
gag-pro, gag-pol, or gag-pro-pol polyproteins from which essential replication 
enzymes are derived. Disruption of the stoichiometry of the gag:gag-pol ratio is 
likely to have drastic consequences. The effect of modulation of the frameshift 
efficiency on virus viability has been studied for a double-stranded RNA virus, 
L-A, of Saccharomyces cerevisae, which employs a frameshift strategy to pro- 
duce a gag-pol fusion protein. Experiments in which the ratio of gag:gag- 
pol proteins in cells was varied have indicated that virus viability would be 
greatly reduced if the ratio is even moderately distorted (Dinman and Wickner, 
1992). 

Frameshifting may also be a strategy to avoid packaging defective RNAs. In 
retroviruses, the packaging signal is located 3'-ward of the splice donor such 
that only unspliced RNAs are packaged. If the gag-pol fusion protein were 
produced from an RNA in which the gag termination codon was removed by a 
small splicing event, then this RNA would be packaged and would be nonviable 
(since it would not be able to produce the gag and pol products in the correct 
stoichiometric amounts]. Retroviruses avoid this problem by employing a 
frameshift strategy. The discovery that the MHV Ib ORF contains the genomic 
packaging signal (van der Most et al., 1991) provides a possible explanation for 
the absence of a subgenomic mRNA for Ib, since this RNA would compete with 
the full-length genome for packaging into virions. Coronaviruses may well 
produce 1b by frameshifting in order to avoid this predicament. 


C. Detection of the Products of mRNA 1 in Vivo 


Only limited success has been achieved in directly detecting coronavirus 
polymerase components in vivo. A single report has been presented in which a 
component was detected following radiolabeling of infected cells (Denison and 
Perlman, 1987). The p28 product, first detected in in vitro translation experi- 
ments with genomic RNA, was found in infected BALB/c 17CL-1 cells, but not 
in purified virions using two-dimensional gel electrophoresis. This small basic 
protein accumulated in the cells at late time. 

More recently, monospecific antisera raised against synthetic peptides or 
bacterial fusion proteins containing coronavirus polymerase sequences have 
been used to detect polymerase components in infected cells. Products from 
both the IBV and MHV mRNA 1 ORFs have been detected using Western 
blotting (Brierley et al., 1990; Denison et al., 1991; Zoltick et al., 1990). A series 
of specific bands was observed, but it has not been possible to study their 
precursor-product relationships using this approach. Denison et al. (1991) used 
immunoprecipitation of radiolabeled MHV-A59-infected cell lysates with anti- 
peptide or antifusion protein antisera to study the processing of the amino- 
terminus of the la polyprotein. The rapid appearance of p28 was demonstrated, 
but a complex spectrum of extremely high-molecular-weight products was de- 
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tected by antisera directed against sequences downstream of the p28 coding 
region. Pulse-chase experiments suggested that polypeptides with apparent 
molecular weights of 240 and 50 kDa might represent mature cleavage products 
from the polyprotein region encoded within the 5'-most 10 kb of MHV 1a. 

More success has been achieved in detecting and characterizing discrete 
products from the IBV ORF 1b. A 100-kDa polypeptide has been detected in 
infected cells by immunoprecipitation using an antibody raised against a region 
of the Ib ORF containing the SDD polymerase motif. Mapping and processing 
mechanisms have been studied using T7 vaccinia/transfection approaches. The 
approximate position of the carboxy-terminus has been mapped using deletion 
analysis and is consistent with generation of the 100-kDa polypeptide by cleav- 
age at a Q/S site thought to be a target for the viral 3CLP (Gorbalenya et al., 
1989). The position of the upstream cleavage site has not been determined 
experimentally, but the size of the processed product is consistent with a 
further 3CLP-mediated Q/S cleavage. The role of the 3CLP in processing has 
been confirmed by deletion of the 3CLP domain from constructs containing 
la/1b sequences; this resulted in appearance of a full-length translation product 
and disappearance of the processed 100-kDa product (Liu et al., 1994). 

The detection of RNA-dependent RNA polymerase activities in TGEV- and 
MHV-infected cells has been reported (Brayton et al., 1982, 1984; Dennis and 
Brian, 1982; Mahy et al.,1983) There is no direct evidence linking these activ- 
ities to the expression of the unique region of mRNA 1, but it is highly likely 
that they are a reflection of complexes formed between the range of polypep- 
tides generated by processing of the 1a and lab polyproteins. The activity is 
associated with membranes and evidence has been presented for several dis- 
tinct activities; an early activity involved in synthesizing full-length negative 
strands and a late activity which can be further subdivided into a replication 
fraction synthesizing full-length positive strands and a transcription fraction 
involved in production of both full-length and subgenomic mRNAs have been 
identified (Brayton et al., 1984). 


III. PUTATIVE NONSTRUCTURAL ORFs PRESENT IN THE 
SUBGENOMIC RNAs OF MHV 


A. mRNA 2 


1. The 2a ORF 


The MHV strains JHM and A59 differ in the expression of ORFs present in 
the genome between the polymerase and the spike genes. In the case of MHV- 
JHM two mRNAs 2 and 2-1 have been identified; RNA 2 contains the 2a ORF in 
its unique region, while RNA 2-1 contains the HE-encoding 2b ORF in its 
unique region. MHV A59, on the other hand, appears to produce only a single 
mRNA 2 that contains the 2a ORF and a defective 2b ORF in its unique region. 
The 2a ORF is considered to encode a nonstructural polypeptide. The 2a ORF 
sequences are highly related in both MHV-JHM and A59 [Luytjes et al., 1988; 
Shieh et al., 1989), potentially coding for, respectively, a 265 and a 261 amino 
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acid polypeptide. Computer analysis of the ORF has suggested the possible 
presence of a nucleotide binding site with the possible implication of a nucleic 
acid binding function (Luytjes et al., 1988). No homologue of the 2a ORF is 
found in IBV, TGEV, FIPV, or CCV. 


2. The 2a Product 


A 30- to 35-kDa product expressed from purified MHV RNA 2 was detected 
following in vitro translation of purified RNA (Leibowitz et al., 1982; Siddell, 
1983). Products of this size had previously been identified in MHV-infected cells 
(Bond et al., 1979; Siddell et al 1981). The expression of the 2a product in MHV- 
infected cells has been confirmed using antisera raised against bacterial fusion 
proteins containing parts of the putative 2a sequence (Bredenbeek et al., 1990a; 
Zoltick et al., 1990). A 30-kDa product was identified by both groups. It had a 
cytosolic localization (Zoltick et al., 1990). The pattern of its expression was 
similar to that of virus structural proteins, but it had a more rapid turnover than 
the nucleocapsid protein. It was not detected in purified virions (Bredenbeek et 
al., 1990a). The function of the 30-kDa product remains unknown, but it has 
been shown that strains of MHV-JHM deleted for 2a show normal growth 
properties in vitro. It is therefore possible that the 2a gene product is only of 
significance when the virus is growing in vivo (Schwarz et al., 1990). 


B. mRNA 4 


1. The 4 ORF 


The unique region of mRNA 4 contains an ORF potentially capable of 
translation to produce a 139-residue polypeptide (Skinner and Siddell, 1985). 
Analysis of the predicted amino acid sequence revealed a number of interesting 
features. It has a large hydrophobic amino-terminus and a basic carboxy- 
terminus. It has a strikingly high content of threonine residues outside the 
hydrophobic region. 


2. The 4 Product 


Early in vitro translation studies failed to conclusively assign a polypeptide 
product of mRNA 4. A product of the size predicted to result from the transla- 
tion of the RNA has been detected in infected cells (Siddell et a1., 1981) and in in 
vitro translation products of RNA fractions enriched for mRNAs 4 and 5 (Sid- 
dell, 1983). The similarity in size of the mRNA 4 ORF and ORFs found in the 
unique region of mRNA 5 made a coding assignment unsatisfactory on the basis 
of these data. The difficulty was resolved by the production of a specific anti- 
serum against a bacterial fusion protein consisting of B-galactosidase fused to 
the predicted carboxy-terminus of the mRNA 4 polypeptide (Ebner et al., 1988). 
The use of this antiserum in immunoprecipitation studies confirmed the ex- 
pression of the predicted polypeptide in infected cells that was detected as a 15- 
kDa product. Its correlation with the previously observed in vitro translation 
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product of approximately 15 kDa was also confirmed by immunoprecipitation. 
Immunofluorescence studies demonstrated a cytoplasmic location. The possi- 
bility that it was as major virion component was excluded; the antiserum 
possessed no neutralizing activity. The function of the 15-kDa product remains 
unknown. The mutation of the gene 4 transcriptional initiation signal in the 
MHV S strain, which prevents synthesis of the gene 4 mRNA, has no effect on 
growth in DBT cells (Yokomori and Lai, 1991). 


C. mRNA 5 


Sequencing of the unique region of mRNA 5 of MHV-JHM and MHV-A59 
revealed the presence of two ORFs, 5a and b (Budzilowicz and Weiss, 1987; 
Skinner et al 1985). No mRNA that might allow expression of the downstream 
ORF by a conventional scanning mechanism has been observed; no transcrip- 
tional signals were obvious in the sequence immediately upstream of the 
5b ORF. 


1. The 5a ORF 


This is 321 bases in length in MHV-JHM and 336 bases in length in MHV- 
A59; it potentially encodes polypeptides of molecular weights 12.4 kDa (107 
residues) and 13 kDa (112 residues), respectively. The predicted polypeptide is 
basic. 


2. The 5a Product 


The putative product of the 5a ORF has only been detected by in vitro 
translation of synthetic mRNA (Budzilowicz and Weiss, 1987). Deletion of 
almost the whole of the 5a ORF in the MHV S strain has no effect on its growth 
in DBT cells (Yokomori and Lai, 1991]. 


3. The 5b ORF 


This is 264 bases in length in MHV-JHM and 249 bases in length in MHV- 
A59; it potentially encodes polypeptides of molecular weights 10.2 kDa (88 
residues) and 9.6 kDa (83 residues), respectively. The predicted polypeptide has 
a strongly hydrophobic region toward the amino-terminus. 


4. The 5b Product 


Recently, it has been shown that the ORF 5b product is, in fact, the sM 
polypeptide of MHV (Yu et al., 1994). Previously, a 9- to 10-kDa product was 
detected in MHV-infected cells by Skinner et al. (1985) and was proposed as a 
candidate for the product of the 5b ORF. Its synthesis in in vitro translation 
reactions was correlated to the abundance of mRNA 5 in the RNA preparations. 
The potential for expression of this ORF was confirmed by in vitro translation 
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of synthetic mRNAs produced by T7 polymerase transcription of cloned cDNA 
sequences (Budzilowicz and Weiss, 1987). It is interesting to note that expres- 
sion of the downstream 5b ORF from RNAs containing both ORFs was substan- 
tially more efficient than that of the upstream 5a ORF. However, the RNA 
species analyzed did not have the structure of the natural mRNA 5. The mecha- 
nism of the selectivity was not clarified. 

The finding of a suboptimal initiator AUG context for the 5a ORF and the 
absence of internal methionine residues in its predicted sequence coupled with 
codon usage data had suggested that both ORFs might be expressed, possibly by 
a leaky scanning mechanism, but recent work using synthetic mRNAs has 
demonstrated that the translation of 5b is likely to involve internal entry of 
ribosomes (Thiel and Siddell, 1994). The expression of the 5b ORF in infected 
cells has been demonstrated using a specific antiserum raised against a 
recA/5b/g-galactosidase fusion protein expressed in E. coli (Leibowitz et al., 
1988). The antiserum detected the 5b product in infected cells using both 
peroxidase labeling of fixed cells and immunoprecipitation of infected cell 
extracts. The observed size was consistent with that of the 9- to 10-kDa product 
previously detected in infected cells by Skinner et al. (1985). 


IV. PUTATIVE NONSTRUCTURAL POLYPEPTIDE ORFs PRESENT 
IN THE SUBGENOMIC RNAs OF BCV 


A. mRNA 2-1 


The close sequence relationship of BCV to MHV is reflected in part by the 
pattern of subgenomic RNAs detected in BCV infected cells. mRNA 2 has a 
unique region potentially coding for a nonstructural polypeptide and a mRNA 
2-1 coding for the HE protein as in MHV-JHM (Keck et al., 1988). 


l. The 2a ORF 


The sequence of the single ORF present in the unique region of mRNA 2 
has been determined (Cox et al., 1989). The predicted polypeptide has a size of 
32 kDa. It shares nucleotide (65%) and amino acid (45%) homology with the 
predicted product of the MHV 2a ORF. An antiserum raised against a B-galac- 
tosidase fusion protein has been used to detect expression of the 2a polypeptide 
(ns2} in infected cells. It has been demonstrated to be a nonstructural phospho- 
protein that accumulates in the cytoplasm during infection. The phosphoryla- 
tion occurs on serine and threonine residues (Cox et al., 1991). 


B. mRNA 4 


The unique region of mRNA 4 as defined by consensus intergenic homol- 
ogy regions contains two small ORFs (Abraham et al., 1990). An RNA species 
likely to correspond to this unique region has been detected (Keck et al., 1988). 
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The MHV RNA 4 codes for a single larger polypeptide; the possibility of an 
artifactual origin for the two BCV ORFs has been eliminated by sequencing of 
independently cloned cDNAs. It is suggested that the two ORFs have arisen as a 
result of a one-base deletion in the middle of a single ancestral ORF (Abraham et 
al., 1990). The addition of a single base gives a single ORF of 11 kDa with 
substantial homology to the 15-kDa product of MHV ORF 4. 


1. The 4a ORF 


This predicts a polypeptide of 4.9 kDa. 


2. The 4b ORF 


This predicts a polypeptide of 4.8 kDa. 


C. mRNAs 5 and 5-1 


Initial analyses of the subgenomic RNAs found in BCV-infected cells sug- 
gested a single mRNA between mRNA 4 and mRNA 6 (Keck et al., 1988). 
Sequence analysis of the putative unique region of this RNA suggested that an 
additional consensus signal for transcriptional initiation is present in this 
region and a reexamination of the subgenomic RNAs corresponding to the 
region revealed the presence of an additional species, 5-1 (Abraham et al., 1990). 
Thus, again, there is a contrast with the situation found in the related virus 
MHV where a single RNA species contains both the 5a and 5b ORFs in its 
unique region and the downstream ORF appears to be efficiently expressed 
(Budzilowicz and Weiss, 1987; Leibowitz et al., 1988; Skinner et al., 1985). The 
likely role of the 5-1 RNA in the expression of the homologue of MHV 5b is 
suggested by the observation that translation of in vitro transcripts containing 
both the 5 and 5-1 ORFs does not result in efficient synthesis of the downstream 
5-1 product (Abraham et al 1990). It is, however, efficiently synthesized when 
in the upstream position. 


1. The 5 ORF 


This predicts a 109-amino acid, 12.7-kDa polypeptide. It has approximately 
50% amino acid homology with the 12.4- and 13-kDa polypeptides of MHV- 
JHM and MHV-A59, respectively (Budzilowicz and Weiss, 1987; Skinner et al., 
1985). 


2. The 5-1 ORF 


This predicts an 84-amino acid, 9.5-kDa polypeptide. This has greater than 
60% amino acid homology with the 10.2- and 9.6-kDa polypeptides of MHV- 
JHM and MHV-A59, respectively (Budzilowicz and Weiss, 1987; Skinner et al., 
1985]. 
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3. The 5-1 Product 


The synthesis of this product in infected cells has been demonstrated using 
immunofluorescence. The specific antiserum used was that prepared against 
the MHV 5b product (Leibowitz et al., 1988). It is expressed both internally and 
on the surface of infected cells. This would be consistent with the presence of a 
potentially membrane-spanning hydrophobic region. As in the case of MHV, 
the 5b polypeptide may, in fact, be the sM polypeptide of BCV [Godet et al., 
1992). 


V. PUTATIVE NONSTRUCTURAL POLYPEPTIDE ORFs PRESENT 
IN THE SUBGENOMIC RNAs OF HCV-OC43 


HCV-OC43 and BCV show remarkable similarity in terms of nucleotide 
sequence and immunological cross-reactivity of their structural proteins, im- 
plying a close "evolutionary" relationship. The region between the S and M 
proteins predicted on the basis of this relationship to encode nonstructural 
proteins has been cloned and sequenced (Mounir and Talbot, 1993). As expected 
it contains ORFs with high homology to those encoding nonstructural polypep- 
tides in the corresponding region of BCV, but only two ORFs were identified. 
They appear to be expressed from separate mRNAs (5 and 5-1) as are their 
counterparts in BCV. The ORFs expected to be expressed from mRNA 4 are 
absent, but, surprisingly, the mRNA is still expressed. 


A. The 5 ORF 


This predicts a 109 amino acid 12.7-kDa polypeptide. It has 96.396 amino 
acid identity with its BCV homologue. 


B. The 5-1 ORF 


This predicts an 84-amino acid, 9.5-kDa polypeptide. This has 96.496 iden- 
tity with its BCV counterpart. This may be the sM polypeptide of HCV OC43. 


VI. PUTATIVE NONSTRUCTURAL POLYPEPTIDE ORFs PRESENT 
IN THE SUBGENOMIC RNAs OF TGEV AND PRCV 


Two large ORFs encoding putative nonstructural polypeptides have consis- 
tently been identified between the spike and small membrane genes of TGEV, 
but there have been differing reports of size and putative mechanisms of expres- 
sion (Britton et al., 1989; Kapke et al. 1989; Rasschaert et al., 1987; Wesley et al., 
1989]. It is possible that in some strains of virus they are expressed from two 
mRNAs, whereas in others only one involved. In one case the existence of the 
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two RNA species has been demonstrated by S1 mapping (Wesley et al., 1989). 
However, for simplicity, the ORFs and their products will be described within 
both frameworks; the putative mRNAs will be referred to as 3 (3-1) and the ORFs 
as 3 (3a) and 3-1 (3b). An extreme 3’ ORF and corresponding mRNA (7) has been 
also been identified (Britton et al 1989; de Groot et al 1988; Jacobs et al 1986). 


A. mRNA 3 


1. The 3 (3a) ORF 


Two groups have sequenced the region containing this ORF using cDNA 
clones derived from the tissue culture-adapted Purdue strain of TGEV (Kapke et 
al., 1989; Rasschaert et al., 1987). The predicted polypeptides are identical, 
being 7.7 kDa in size. Two further sequences from virulent isolates, the 
FS772/70 strain (Britton et al., 1989) and the Miller strain (Wesley et al., 1989) 
have been determined. The predicted polypeptides differ in these cases, being 
6.6 kDa and 7.9 kDa, respectively, in size. All groups have identified a putative 
upstream transcription reinitiation site thought to be involved in synthesis of 
mRNA 3. Its function has been confirmed using S1 mapping in one case (Wesley 
et al., 1989). The sequences of these predicted polypeptides have been analyzed 
using various computer-based techniques, but no characteristics suggestive of 
function have been identified. 

The putative initiation site for transcription mRNA 3 and part of the 3a 
ORF is missing from the porcine respiratory coronavirus (PRCV) sequence. It is 
unlikely, therefore, that the 3a ORF is expressed in this "variant" of TGEV (Page 
et al., 1991). 


2. The 3-1 (3b) ORF 


The region containing this ORF has again been sequenced by four groups 
using the Purdue strain and the two virulent strains, FS772/70 and Miller 
(Britton et al., 1989; Kapke et al., 1989; Rasschaert et al., 1987; Wesley et al., 
1989). A striking discrepancy between the two Purdue sequences has been 
noted. The sequence of Rasschaert et al. (1987) predicted a polypeptide of 18.8 
kDa; the subsequent work of Kapke et al. (1988) suggested that there may have 
been a sequencing error in the initial study leading to a truncation of the 
predicted 3-1 ORF. The data of Kapke et al. (1988) are consistent with the 
observation of a 24-kDa in vitro translation product from mRNA 3-enriched 
preparations (Jacobs et al., 1986] and the subsequent virulent strain ORF size 
predictions of 27.6 kDa (FS772/70) (Britton et al., 1989) and 27.7 kDa (Miller) 
(Wesley et al., 1989). The most notable feature of this predicted ORF is the 
hydrophobicity of its N-terminus and the clustering of charged residues at the 
carboxy-terminus. 

In PRCV, the 3 (3b) ORF is likely to expressed from a 5’ truncated version of 
mRNA 3 transcribed from a novel initiation site immediately upstream of ORF 
3 (Page et al., 1991). 
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1. The 7 ORF 


The sequencing of the extreme 3’ end of the TGEV genome has revealed the 
presence of a small ORF that may be expressed from a 0.7-kb mRNA (Britton et 
al., 1989; Kapke and Brian, 1986; Rasschaert et al 1987). The 9-kDa polypeptide 
predicted by the ORF has the characteristics of a membrane polypeptide; it 
contains hydrophobic sequences at the N- and C-termini separated by a central 
hydrophilic region. 


VII. PUTATIVE NONSTRUCTURAL POLYPEPTIDE ORFs 
PRESENT IN THE SUBGENOMIC RNAs OF FIPV AND FECV 


Two mRNAs [other than the genomic RNA, RNA 1) that may encode 
nonstructural polypeptides have been identified in FIPV-infected cells; these are 
a 5.2-kb (RNA 3} and a 1.6-kb (RNA 6) species (de Groot et al., 1987). Sequence 
data are only available for the unique region of RNA 6 (de Groot et al., 1988). 


A. mRNA 6 


1. The 6a ORF 


In the genomic sequence, this ORF overlaps with the 3’ end of the nucle- 
ocapsid ORF. S1 nuclease analysis, however, has demonstrated that only se- 
quences downstream of the nucleocapsid ORF are contained in mRNA 6 (de 
Groot et al., 1988). This is consistent with the finding of a consensus transcrip- 
tion initiation signal at the mRNA body junction detected by S1 mapping. On 
this basis the 6a ORF predicts an 11-kDa polypeptide. The 11-kDa polypeptide 
predicted by ORF 6a of FIPV is strongly homologous to the 9-kDa polypeptide 
predicted by the 7 ORF of TGEV. ORF 6a of FIPV is 69 nucleotides longer than 
ORF 7 of TGEV. The insertion extends the hydrophilic central part of the 
predicted polypeptide. The feline enteric coronavirus (FECV) 6a polypeptide is 
99% identical (one amino acid difference] with that of FIPV (Vennema et al., 
1992). 


2. The 6b ORF 


This ORF predicts a 22-kDa polypeptide that is not homologous to any 
TGEV polypeptide. It is hydrophilic with a short hydrophobic section at the 
N-terminus. There is no consensus transcription initiation signal appropriate 
for expression of this ORF from its own mRNA and no RNA smaller than the 
1.6-kb species containing both 6a and 6b has been detected in infected cells (de 
Groot et al., 1988) The putative methionine initiation codon for the 6b ORF is in 
an optimal context. The 6a ORF contains two internal methionine residues in 
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addition to the initiator methionine, but these are in poor contexts. This 
situation, however, does contrast with that found in other cases where leaky 
scanning expression of downstream coronavirus ORFs has been invoked; in these 
cases, internal methionines are absent from the upstream ORFs (Boursnell et 
al., 1985; Skinner et al., 1985). The FECV 6b ORF shows a carboxy-terminal 
deletion when compared to that of FIPV; the amino-terminal 123 residues show 
89% identity to the equivalent FIPV sequence (Vennema et al., 1992). 


VIII. PUTATIVE NONSTRUCTURAL POLYPEPTIDE ORFs 
PRESENT IN THE SUBGENOMIC RNAs OF CCV 


Of a CCV sequence, 9.6 kb extending from the region encoding the carboxy- 
terminus of ORF Ib to the 3' end of the genome is available for strain Insavc-1 
(Horsburgh et al., 1992). Sequences from the extreme 3’ end of the genomes of 
strains K378 and I-71 are also available (Vennema et al., 1992). Two mRNAs (3 
and 7) each encoding two putative nonstructural polypeptides have been identified. 


A. mRNA 3 


1. The 3a ORF 


This encodes a 8.6-kDa polypeptide that shows 83.5% identity to the 
TGEV 3a polypeptide (Horsburgh et al., 1992). 


2. The 3b ORF 


This encodes a 4.0-kDa polypeptide. This modest coding potential is the 
consequence of a termination codon present within a “potentially larger" ORF 
(28.4 kDa} that has 92.4% identity with the equivalent TGEV ORF (Horsburgh 
et al., 1992). In spite of the presence of a potential transcriptional initiator 
sequence immediately upstream of the 3b ORE, no evidence for the synthesis of 
a corresponding mRNA has been obtained (Horsburgh et al., 1992). 


IX. PUTATIVE NONSTRUCTURAL POLYPEPTIDE ORFs PRESENT 
IN THE SUBGENOMIC RNAs OF HCV 229E 


The unique regions of mRNA 4 and 5 of HVC 229E have been sequenced 
(Jouvenne et a1.,1992; Raabe et al., 1990; Raabe and Siddell, 1989). 


A. mRNA 4 


1. The 4a ORF 


This ORF predicts a polypeptide of 15.3 kDa that lacks internal methi- 
onines (Raabe et al., 1990; Raabe and Siddell, 1989). 
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2. The 4b ORF 


This ORF predicts a polypeptide of 10.2 kDa that is characterized by a 
hydrophobic amino-terminus (Raabe et al., 1990; Raabe and Siddell, 1989). 
cDNAs encoding deleted versions of ORFs 4a and 4b have been cloned and 
sequenced; the predicted polypeptide products are 4.7 kDa and 9.6 kDa, respec- 
tively. It seems likely that the cDNAs were derived from a minor mRNA 
species (Jouvenne et al., 1992). 


B. mRNA 5 


1. The 5 ORF 


This ORF predicts a hydrophobic polypeptide of 9.1 kDa that is homolo- 
gous to the corresponding sM polypeptide of TGEV (Godet et al., 1992). 


X. PUTATIVE NONSTRUCTURAL POLYPEPTIDE ORFs PRESENT 
IN THE SUBGENOMIC RNAs OF IBV 


A. mRNA 3 


Nucleotide sequences for the unique region region of IBV mRNA 3 have 
been reported for three virus strains: Beaudette, M41, and KB8523 (Boursnell et 
al., 1985; Niesters et al., 1986; Sutou et al., 1988). The predicted amino se- 
quences of the three ORFs detected are conserved in all strains. The absence of 
internal methionines and the poor contexts of the initiator methionines of the 
upstream ORFs initially focused attention on the possibility of downstream 
ORF expression by leaky scanning, but it has been subsequently demonstrated 
that expression of the 3c ORF involves internal ribosome entry (Liu and Inglis, 
1992). There is no evidence for individual mRNAs for the three ORFs. 


1. The 3a ORF 


The polypeptide predicted to be translated from this ORF has a molecular 
weight of 6.7 kDa. It is neutral and hydrophobic. It has been detected in infected 
cells using antifusion protein antisera (Liu et al., 1991). 


2. The 3b ORF 


The polypeptide predicted by this ORF has a molecular weight of 7.4 kDa. 
It is highly acidic. It has been detected in infected cells using antifusion protein 
antisera (Liu et al., 1991). 


3. The 3c ORF 


The polypeptide predicted by this ORF has molecular weight of 12.4 kDa. It 
has a hydrophobic domain near the N-terminus, potentially capable of spanning 
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cellular membranes, and a hydrophilic C-terminal domain. The hydrophobic 
domain is flanked by hydrophilic residues. Codon usage in the 12.4-kDa ORF 
conforms to that of the IBV structural proteins; the codon usage of the 6.7- and 
7.4-kDa ORFs does not. 


4. The 3c Product 


The expression of the 3c polypeptide product in IBV-infected cells has been 
demonstrated using a specific antiserum raised against a bacterial fusion pro- 
tein consisting of B-galactosidase and the complete 3c coding region (Smith et 
al., 1990). A 12.4-kDa product was detected in infected cells by immuno- 
precipitation. It was largely located in the large-membrane subcellular fraction 
of infected cells along with the spike and membrane glycoproteins; nucleocap- 
sid was, on the other hand, found largely in the microsomal fraction. This is 
consistent with a membrane location for the 3c product. Immunofluorescence 
studies demonstrated that the polypeptide was located internally and on the 
cell surface; there was some indication of the internal fraction of 3c being 
located in the Golgi and that the 3c polypeptide is present in virions (Liu and 
Inglis, 1991). It is clear that it is homologous to the sM polypeptide of TGEV. 


B. mRNA 5 


The nucleotide sequence of the unique region of IBV mRNA 5 has been 
determined for the Beaudette strain (Boursnell and Brown, 1984). Two ORFs 
were identified. No evidence for an mRNA capable of expressing the down- 
stream ORF independently has been obtained. Neither predicted polypeptide 
contains an internal methionine. 


1. The 5a ORF 


The predicted polypeptide has a molecular weight of 7.5 kDa. The polypep- 
tide is hydrophobic, having a very high leucine content (17 of 65 residues]. 


2. The 5b ORF 
The predicted polypeptide has a molecular weight of 9.5 kDa. 


Xl. CONCLUSION 


All coronaviruses appear to encode a substantial number of nonstructural 
polypeptides; this is consistent with their genetic complexity. The ORFs pres- 
ent in the unique regions of the genomic RNA (mRNA 1) from IBV, MHV, and 
HCV show substantial homology in amino acid sequence and in their organiza- 
tion. The smaller putative nonstructural ORFs are, however, more hetero- 
geneous in predicted sequence, genomic location, and putative mode of expres- 
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sion even within clusters of related viruses. This plasticity is particularly 
striking in the case of polypeptides encoded between S and M. Expression in 
infected cells of many of the ORFs detected by sequencing has not yet been 
demonstrated and there is little or no insight into their functions. It will 
difficult to determine these functions in the absence of systems, for example, 
for genetic analysis of in vitro replication/transcription and for the production 
of site-directed mutations in the viral genome. 
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CHAPTER 11 


The Molecular Biology 
of Toroviruses 


Eric J. SNIDER AND MARIAN C. HORZINEK 


I. INTRODUCTION 


Recently, the International Committee on the Taxonomy of Viruses has in- 
cluded the toroviruses as a new genus in the previously monogeneric family of 
the Coronaviridae (Pringle, 1992, Cavanagh et al., 1994). Toroviruses and corona- 
viruses share some similarities in their structure and biology (see Chapter 18, 
this volume]; however, it was the molecular characterization of the torovirus 
prototype, Berne virus (BEV), that produced the evidence for an evolutionary 
link between the two virus groups. 

After the recognition of the link between toroviruses and coronaviruses, 
the genome organization and replication strategy of a third virus group, the 
arteriviruses, were found to be strikingly similar to both genera of the Corona- 
viridae (Den Boon et al., 1991b). However, it is evident that the arteriviruses are 
more distantly related than toroviruses and coronaviruses, and the inclusion of 
the arteriviruses as a third genus in the Coronaviridae was deemed inappropri- 
ate. Nevertheless, to indicate the ties between coronaviruses, toroviruses, and 
arteriviruses the unofficial term "coronaviruslike superfamily" has been put 
forward (Den Boon et al., 1991b). The similarities and differences between the 
members of this superfamily will be discussed elsewhere (Chapter 12, this 
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volume). In this chapter, we will focus on the molecular analysis of the 
torovirus particle, genome, mRNAs, and structural and nonstructural proteins. 


Il. THE TOROVIRION 


The equine torovirus BEV was isolated from a diarrheic horse in 1972 at the 
University of Berne, Switzerland. However, the virus was not studied in detail 
until particles with the same morphology were observed in the feces of diar- 
rheic cattle in Breda, Iowa, in 1979 (Weiss et al., 1983; Woode et al., 1982). 
Similar pleiomorphic viruses were also found in human feces collected in 
Birmingham, UK, and Bordeaux, France (Beards et al., 1984, 1986) and, more 
recently, in Toronto, Canada (Tellier and Petric, 1993). 

To date, BEV is the only torovirus that can be propagated in cultured cells. 
As a result, it is the best-studied member of the genus. The unique structure of 
the torovirus particle initially led to the proposal of a new family of enveloped 
animal viruses, the Toroviridae (Horzinek and Weiss, 1984; Horzinek et al., 
1987). As illustrated in Figs. 1 and 2, toroviruses are enveloped pleiomorphic 
particles measuring 120 to 140 nm in their largest axis. Spherical, oval, elon- 
gated, and kidney-shaped viruses are observed (Weiss et al., 1983). Drumstick- 
shaped projections, very similar to the spikes of coronaviruses, decorate the 
virion surface. However, as illustrated in Fig. 2A, the nucleocapsid structure of 
the toroviruses is very different from that of coronaviruses: the core of the 
torovirion is formed by an electron-dense, tubular structure, which may be 
straight or may display the typical toro-shape, that is prevalent in mature 
virions. The straight form is observed especially prior to the process of budding, 
which takes place at intracellular membranes, predominantly those of the 
Golgi system (Weiss and Horzinek, 1986; Fagerland et al., 1986). The current 
structural model of BEV, based on electron micrographs and the molecular data 
described below, is shown in Fig. 2B. 





FIGURE 1. Negative staining of Berne virus particles. Magnification: approximately 150,000x 
(Weiss et al., 1983). 
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II. THE TOROVIRUS GENOME 


The BEV genome is a single molecule of infectious, polyadenylated RNA 
(Snijder et al., 1988) with an estimated size of 25-30 kb. This size estimate is 
based on the sequence analysis of a substantial part of the genome (about 16 kb] 
and the parallels between the genome organization of BEV and other members 
of the Coronaviridae. 

The BEV genome probably contains six open reading frames {ORFs} (Fig. 3). 
As for all other coronaviruses, the two most 5’ reading frames (ORF la and 1b) 
are translated from the genomic RNA and constitute the viral replicase gene. 
The four remaining reading frames, of which ORFs 2, 3, and 5 have been 
identified as structural genes, are expressed by generation of a 3'-coterminal 
nested set of mRNAs (see Section IV). 

The two parts of the BEV genome that have been sequenced are separated 
by a gap in the ORF la region. About 1.5 kb of the 5’ end sequence of the BEV 
genome was determined from cDNA clones derived from genomic RNA and 
defective interfering (DI) RNAs (Snijder et al., 1991b). A sequence of about 14.5 
kb, starting 1 kb upstream of the ORF la/ORF 1b ribosomal frameshift site and 
ending at the 3’ poly(A) tail, was obtained by analysis of a cDNA library 
prepared from poly(A]-containing RNA from BEV-infected cells (Snijder et al., 
1990a,c). 

Assuming that the 5’ ends of the BEV genome and BEV DI RNAs are 
colinear, the ORF la initiation codon is located at nucleotide (nt) position 825- 
827 in the genome. The region upstream of ORF la is probably untranslated 
since it contains only one other AUG codon (nt 22-24) that is followed by a 
UGA termination codon three triplets downstream (Snijder et al., 1991b). 

The 3’ nontranslated region (NTR) of the BEV genome encompasses 200 nt, 
excluding the poly(A) tail (Snijder et al., 1989). Interestingly, a computer anal- 
ysis of this region reveals a large potential stem-loop structure that may be one 
of the signals for viral RNA replication (Snijder et al., 1989). The 3' NTR is the 
only region of the genome for which the sequence of a second torovirus, the 
bovine Breda virus (BRV), is available. A short 269-nt sequence upstream of the 
BRV poly(A) tail was obtained from the analysis of a cDNA library prepared 
from genomic RNA purified from the feces of an infected animal (Koopmans et 
al., 1991). This sequence, comprising the 3’ end of the BRV nucleocapsid (NI 
protein gene and the 3’ NTR, is 93% identical to the BEV sequence. Hybridiza- 
tion experiments suggest that there is a high degree of sequence similarity 
between equine and bovine toroviruses, with the possible exception of the bi 
part of the spike (S) protein gene (Koopmans et al., 1991). 


IV. BERNE VIRUS TRANSCRIPTION AND TRANSLATION 


A 3'-coterminal nested set of 5 mRNAs (including the genome) is generated 
during BEV replication (Snijder et al., 1990c) (Fig. 3). The estimated sizes of 
these RNAs and their relative abundance late in infection (Snijder et al., 1988) 
are listed in Table I. 





ss + RNA 


25-30 kb 
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The analysis of BEV subgenomic (sg) RNA synthesis by UV transcription 
mapping (Snijder et al., 1990c} indicates that (late in infection) the BEV mRNAs 
are transcribed independently, i.e., from separate transcription units. In com- 
mon with other coronaviruses, the BEV genomic sequence contains conserved 
intergenic sequences that are probably involved in mRNA transcription. For 
some coronaviruses, these intergenic motifs [e.g., 5’ AAUCuAuAC 3’ for mu- 
rine hepatitis virus (MHV |] have been identified as the site where the common 
5' leader sequence is fused to the mRNA bodies (Spaan et al., 1988; Lai, 1990). 
Although the torovirus genome contains conserved AU-rich intergenic se- 
quences (5' uaUcUUUAGa 3’) (Fig. 4], there appears to be an important differ- 
ence, since in the case of torovirus RNAs no evidence for fusion of a common 
leader to mRNA bodies at this position has been obtained. Thus, none of the 
clones from a cDNA library prepared from poly(A)-selected intracellular RNA 
contained a candidate leader sequence. In addition, primer extension analyses 
of RNA 5 and DI RNAs indicated that BEV mRNAs terminate at, or just 
upstream of, the conserved intergenic sequence (Snijder et al., 1990c, 1991b). In 
terms of transcription, however, the consequences of this dissimilarity be- 
tween toro- and coronaviruses could be limited: direct binding of the poly- 
merase to the various BEV "core promoters" on a negative-stranded template 
could simply replace a leader-priming mechanism. 

In addition to the products of ORFs 2, 3, 4, and 5, which are assumed to be 
synthesized by monocistronic translation of a nested set of structurally poly- 
cistronic mRNAs, the 3’ part of the BEV genome may encode one more protein: 
ORF 5 completely overlaps with a 264-nt ORF that potentially encodes a hydro- 
phobic 10-kDa protein. Although no such protein has been observed in virions 
or BEV-infected cells, it is remarkable that a similar situation, a small hydro- 
phobic protein expressed from an ORF that completely overlaps with the N pro- 
tein gene, has been reported for the bovine coronavirus (BCV) (Senanayake et 
al., 1992). 

BEV ORF lb is expressed from the genomic RNA by a —1 ribosomal frame- 
shift (Snijder et al., 1990a), which apparently is one of the hallmarks of the 
coronaviruslike supergroup. As a result, two proteins are produced from the 
replicase region: the ORF la protein and an ORF 1a/1b fusion protein. BEV ORF 
la ends with the nucleotide sequence 5’ U UUA AAC UGU UGA 3’, in which 
the UGA codon terminates ORF la translation. The heptanucleotide sequence 
5' U UUA AAC 3’ is one of the typical “shifty sequences," which have been 
described to be present at ribosomal frameshift sites of an increasing number of 
viruses (Jacks et al., 1988; Brierley et al., 1989; Ten Dam et al., 1990). Two 
versions of the downstream RNA pseudoknot (PK), which usually accompanies 
a shifty heptanucleotide and which is thought to promote frameshifting by 


X ——————————————————————————————————— 
FIGURE 2. (A) Different forms of BEV particles visualized in ultrathin sections of BEV-infected 
equine dermis cells. On the right, electron micrographs of BEV particles are shown; on the left, 
schematic interpretations of the viral structures seen in the corresponding photographs are pre- 
sented. Section planes "1" and "2" cut the nucleocapsid twice and once, respectively (Weiss and 
Horzinek, 1987). (B) Schematic representation of the architecture of BEV, the torovirus prototype. 
The localization and sizes of the structural proteins and genome are indicated (Snijder and 
Horzinek, 1993). 
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FIGURE 3. Genome organization and expression of BEV. The location of the 6 ORFs is indicated. 
The dashed lines indicate the ORF la region that remains to be sequenced (estimated size: 10 to 12 
kb). The lower part of the figure shows the relationship of the genome and the nested set of mRNAs 
generated in infected cells (Snijder and Horzinek, 1993). 


stalling the ribosome at the frameshift site, can be predicted for BEV (Fig. 5). 
They differ in the size of loop 2 (L2], which is either 11 (PK1) or 69 nt (PK2). The 
BEV frameshift signal was tested in vitro and in vivo using a reporter gene 
construct. Frameshift efficiencies between 20 and 30% were observed. A dele- 
tion mutation which obviates the posibility of a PK2 structure in the tran- 
scribed RNA did not influence the frameshift efficiency, suggesting that it, in 
fact, is PK] that is involved in the regulation of ORF 1b expression (Snijder et al., 
1990a]. 


V. DI PARTICLES AND RNAs OF BEV 


DI RNAs were readily generated during undiluted passage of BEV in embry- 
onic mule skin cells (Snijder et al., 1991b]. The sequence of two small BEV DI 
genomes, which arose and interfered strongly with viral replication within five 
undiluted passages, has been determined. A DI RNA of about 1 kb was found to 
contain about 600 nt from the 5' end of the viral genome and 242 nt [excluding 
the poly(A) tail] from the 3’ end. A somewhat larger DI RNA (1.4 kb) contained 
larger parts from the same segments of the genome (about 700 and 441 nt, 
respectively) and an additional central part of 100 nt, which is thought to be 


TABLE I. Characteristics of BEV RNAs and ORFs 








Relative Position Number Calculated 
Size molarity Encoded ^ upstream of of aa size 

RNA (kb) (96) ORF 3' end encoded (kDa) Protein 

1 25-30? 2 la ?-13,468 ? 2 Replicase 

lb 13,477-6,604 2291 261 Replicase 

2 6.9 3 2 6,684-1,941 1581 178 Spike 

3 2.1 30 3 1,909-1,210 233 26 Membrane 

4 1.4 13 4 1,150-724 142 16 Pseudogene? 

5 0.8 52 5 680-200 160 18 Nucleocapsid 
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FIGURE 4. Alignment of putative BEV “core promoter” sequences (indicated with asterisks} and 
their flanking context. Fully conserved nucleotides are boxed. The sequences are derived from 
sequence analysis of the genome (Snijder et al, 1990c) and of DI RNAs (Snijder et al., 1991b) of BEV 
(Snijder and Horzinek, 1993). 


derived from the ORFla region, since it does not match any of the known BEV 
sequences. 

The composition of the 1-kb DI genome suggests that the minimal se- 
quences required for BEV RNA replication (and probably also packaging) are 
located in two small domains present at the termini of the genomic RNA. This 
suggests a difference with members of the genus coronavirus. The generation 
and construction (at the cDNA level) of MHV DI genomes that are both repli- 
cated and packaged has been described, and DI-based systems are currently 
being used to study viral transcription and replication (Makino 1990, 1991; van 
der Most et al., 1991, 1992). However, all viable MHV DI RNAs contain at least 
three segments from the viral genome. In addition to 5' and 3' sequences, 
sequences from ORFIb, which are assumed to contain a packaging signal, are 
required for efficient propagation of MHV DI genomes (Makino et al., 1990, 
1991; Van der Most et al., 1991). 

Although BEV DI genomes and particles were initially generated and used 
as a tool to study viral replication, they also provide information on the previ- 
ously decribed heterogeneous nature of BEV preparations (Weiss et al., 1983, 
Snijder et al., 1988). A second peak of virus-specific material, sometimes re- 
ferred to as 50S particles, has been observed in both isokinetic and isodensity 
gradients. Ultracentrifugation experiments, using virus stocks containing the 
1-kb DI RNA, confirmed that this second peak represented particles that con- 
tained the DI genome and displayed both smaller S values (50-100S instead of 
400S) and lower densities (1.07—1.11 g/ml vs. 1.16 g/ml in sucrose) than standard 
virions. However, the protein composition of BEV DI particles remains to be 
studied in detail, especially since preliminary data suggested that both the S 
protein and the N protein were absent, or present only in very small quantities 
(Snijder et al., 1991b). 


VI. THE BERNE VIRUS REPLICASE GENE 


In addition to the discovery that the genome organization and expression 
strategy of toroviruses are very similar to those of coronaviruses, sequence 
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analysis of the BEV ORFIb region also provides conclusive evidence for an 
ancestral relationship between the two virus groups (Snijder et al., 1990a]. 

Two of the three basic enzymatic activities that are usually encountered in 
the replicase proteins of positive-stranded RNA viruses (polymerase, helicase, 
and protease] are encoded by the ORFIb sequence of toro- and coronaviruses 
(Figs. 6 and 7), namely, the polymerase (or SDD) domain (Poch et al., 1989) and 
the NTP-dependent helicase domain (Gorbalenya and Koonin, 1993). On the 
basis of the sequence analysis shown in Fig. 7, the percentage of identical amino 
acids (aa) in toro- and coronavirus polymerase and helicase domains can be 
calculated at 40 to 45% (70-90% between members of the genus coronavirus). 
This analysis clearly separates the toro- and coronavirus sequences from the 
corresponding domains of other virus (super)groups and also illustrates the 
relatively large evolutionary distance between the two genera of the family 
Coronaviridae. 

In addition to the "universal" domains described above, a number of less 
striking sequence similarities, which nevertheless appear to be Coronaviridae- 
specific, have been detected. The most remarkable example of such a domain is 
found at the C-terminus of the ORF 1b protein (Snijder et al. 1990a) (Figs. 6 and 
7). The last 300 aa of the ORF 1b product are approximately 4096 identical when 
toroviruses and coronaviruses are compared. Additional smaller conserved mo- 
tifs are a cysteine/histidine-rich domain, located between the polymerase and 
helicase regions, and a short region of the ORFla protein, just upstream of the 
ribosomal frameshift site (Snijder et al., 1990a, 1991a]. 

Several putative protease domains, belonging to different protease super- 
groups, have been identified in the ORFla proteins of the coronaviruses avian 
infectious bronchitis virus (IBV), MHV, and human coronavirus (HCV) 229E 
(Gorbalenya et al., 1989; Lee et al., 1991; Herold et al., 1993). Unfortunately, 
only short sequences from both ends of the BEV ORF la protein are available and 
these do not contain any obvious protease domains. Therefore, it remains to be 
seen whether the proteolytic processing pattern of the torovirus replicase fol- 
lows the examples provided by the corresponding proteins of coronaviruses and 
arteriviruses (Snijder and Horzinek, 1993). For these two virus groups, a pro- 
tease belonging to the chymotrypsin/3C-like supergroup is predicted to be the 
main protease, responsible for the processing of the C-terminal part of the 
ORFla protein and the ORFIb polypeptide. Additional protease domains, be- 
longing to the papainlike family of proteolytic enzymes, direct the processing of 
the N-terminal part of the ORFla product (Snijder et al., 1992; Baker et al., 
1993). 


VII. THE STRUCTURAL PROTEINS OF TOROVIRUSES 


Torovirus proteins were initially analyzed by metabolic labeling of BEV- 
infected cells (Horzinek et al., 1984). Subsequent sequence analysis of the 
structural genes and characterization of their products identified three struc- 
tural proteins: a 19-kDa nucleocapsid [N] protein, a 26-kDa integral membrane 
protein, and a 180-kDa spike protein [which is posttranslationally cleaved into 
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two subunits). These proteins were assigned to ORFs 5, 3, and 2, respectively, in 
the BEV genomic sequence (Fig. 3) (Den Boon et al., 1991a; Snijder et al., 1989, 
1990b). Although the small-membrane protein and the spike protein were ini- 
tially termed E (for envelope) and P (for peplomer), they have now been renamed 
to M and S to follow the coronavirus protein nomenclature (Snijder and 
Horzinek, 1993). 

Since BRV cannot be grown in cultured cells, its protein composition was 
studied by means of surface radioiodination of purified virus (Koopmans et al., 
1986). Possible virus-specific polypeptide species of 20, 37, 85, and 105-kDa 
were identified in this manner. 
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FIGURE 6. (A] Proportional dot matrix comparison of the amino acid sequences of the predicted 
ORF 1b products of the torovirus BEV and the coronavirus IBV (Snijder et al. 1990a). (B) Position of 
conserved domains in the replicases of the torovirus BEV and the coronaviruses IBV (Beaudette 
M42), MHV (strain A59), and HCV 229E: a, papainlike protease; b, 3C-like protease; c, polymerase; 
d, cysteine/histidine-rich domain; e, helicase; f, C-terminal Ib domain. Sequence comparisons of 
domains c, e, and f are presented in Fig. 7. 


A. The N Protein 


A. 19-kDa protein was present in purified BEV nucleocapsids that were 
obtained by detergent treatment of purified virus particles (Horzinek et al., 
1985). This N protein is the most abundant structural protein of the BEV 
particle, accounting for about 8096 of its protein mass. It is a phosphorylated 
protein with RNA-binding properties (Horzinek et al., 1985). By in vitro transla- 
tion studies using BEV RNA 5 (Snijder et al., 1988) and sequence analysis of the 
region upstream of the poly(A) tail (Snijder et al., 1989) the N protein gene was 
identified as the 3'-terminal gene in the BEV genome. 

The N protein consists of 160 aa. It contains 7 (4%) acidic and 22 (14%) basic 
amino acid residues. The latter are especially clustered in a central domain of 
the N protein sequence: 15 basic residues are found between aa 34 and 80 
(Snijder et al., 1989). | 

The size of the BEV N protein (18.3kDa] is noticeably smaller than that of 
the corresponding protein of coronaviruses (45-55kDa) (Spaan et al., 1988). 
Apparently, the different architecture of the torovirus nucleocapsid, which is 
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tubular instead of the helical core found in the genus coronavirus, dictates the 
use of a different type of N protein. Therefore, it is not surprising that no 
significant sequence similarities are detected in the N proteins of toro- and 
coronaviruses are compared. 


B. The M Protein 


The BEV M protein, which is translated from ORF 3 (Snijder et al 1988), is 
an unglycosylated polypeptide (Horzinek et al 1986), accounting for about 1396 
of the virion protein mass (Horzinek et al 1985). The M protein is a 26.5-kDa 
polypeptide of 233 aa. It does not contain an N-terminal signal sequence. In its 
N-terminal half, the three membrane-spanning a-helixes that are so charac- 
teristic for coronavirus M proteins are encountered (Fig. 8) (Den Boon et al., 
1991a). The small difference between the calculated and observed sizes of the M 
protein in sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS- 
PAGE] is explained by aberrant migration, probably due to the extreme hydro- 
phobicity of the protein. 

The membrane topology of the M protein has been studied by in vitro 
translation using the M protein itself and a hybrid protein which contained a 
C-terminal tag (Den Boon et al., 1991a]. After in vitro translation in the presence 
of microsomes, about 85% of the M protein was resistant to protease K diges- 
tion. Therefore, its disposition in the membrane is thought to be very similar to 
that of the triple-spanning M proteins of coronaviruses such as MHV and IBV. 
The N-terminus is located in the lumen of the endoplasmic reticulum (and 
eventually at the virion surface], a large central part of the protein is embedded 
in the membrane, and the C-terminus is located at the cytoplasmic face of the 
endoplasmic reticulum. One of the hydrophobic transmembrane domains is 
assumed to function as an internal signal sequence. The BEV M hybrid carrying 
the C-terminal tag accumulated in intracellular membranes during transient 
expression experiments. Thus the torovirus M protein may play a role in the 
intracellular budding process, as has been suggested for its coronavirus counter- 
part (Dubois-Dalcq et al., 1982; Holmes et al., 1981; Rottier et al., 1981). 

Despite the striking similarities in size, structure, and functional charac- 
teristics, no significant primary sequence similarities can be detected between 
the M protein sequences of BEV and coronaviruses (Den Boon et al., 1991a). Still, 
the observed similarities are taken as evidence for a process of divergent evolu- 
tion during which the properties essential for virus assembly have been con- 
served. 


C. The S Protein 


The BEV envelope is studded with drumstick-shaped projections that mea- 
sure about 20 nm in length (Weiss et al., 1983) (Fig. 1). In early torovirus studies, 
the heterogeneous 75- to 100-kDa protein material was assumed to represent 
the viral spike protein(s], since it was recognized by both neutralizing and 
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FIGURE 8. Comparison of the hydrophobicity profiles of the M proteins of the torovirus BEV and 
the coronavirus MHV. Free energy values corresponding to the insertion of successive 21-aa protein 
segments from an aqueous environment into a lipid bilayer. Calculations were done according to 
Von Heijne (1981) and plotted as a function of the first amino acid residue in the segment (den Boon 
et al., 19913). 


hemagglutination-inhibiting monoclonal antibodies (Horzinek et al., 1986; 
Kaeffer et al., 1989). Pulse-chase labeling experiments revealed that the 75- to 
100-kDa material is derived from a 200-kDa precursor that is found in infected 
cells, but not in virus particles (Horzinek et al., 1986). Extensive N-glycosyla- 
tion and proteolytic cleavage of the precursor are part of the posttranslational 
processing of the torovirus S protein. The cross-reactivity between different 
BRV “serotypes” in radioimmunoprecipitation indicates that the 85- and 105- 
kDa protein species, which were identified in radioiodination studies, also 
probably represent the surface structures (Koopmans et al., 1986). 

The backbone of the BEV S protein is an apoprotein of 1581 aa (M, about 178 
kDa) encoded by ORF2. Among the features it shares with the S proteins of 
other coronaviruses are a large number (18) of potential N-glycosylation sites, 
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an N-terminal signal sequence, a putative C-terminal transmembrane anchor, 
two putative heptad repeat domains, and a possible cleavage site for a "trypsin- 
like" protease (Fig. 9). The mature S protein consists of two subunits (S1 and S2, 
according to the coronavirus protein nomenclature). Endoglycosidase F diges- 
tion of SI and S2 strongly suggests that the S protein is, indeed, cleaved at the 
predicted cleavage site, which consists of five consecutive arginine residues 
(Snijder et al., 1990b). As postulated previously for other S proteins (De Groot et 
al., 1987), the heptad repeats in S2 are assumed to be involved in the generation 
of an intrachain coiled-coil secondary structure. Interchain interactions of the 
same kind may be involved in protein oligomerization. In a sucrose gradient 
assay the BEV S protein was shown to form dimers (Snijder et al., 1990b). The 
intra- and interchain coiled-coil interactions may stabilize the elongated spikes 
of all Coronaviridae. 


VIII. INDICATIONS FOR RECOMBINATION DURING 
TOROVIRUS EVOLUTION 


Although all Coronaviridae display the same basic gene order (replicase, S, 
M, N), the number and location of additional reading frames in the genomic 
RNA is quite variable (Spaan et al., 1988). RNA recombination, which is consid- 
ered an important factor in RNA virus evolution (Strauss and Strauss, 1988; 
Goldbach and Wellink, 1988; Lai, 1992), is the most attractive explanation for 
the observed genetic heterogeneity. This idea is supported by the fact that MHV 
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FIGURE 9. Schematic representation of the position of characteristic domains in the BEV peplomer 
protein. The positions of heptad repeats, hydrophobic segments, potential N-glycosylation sites and 
the putative cleavage site are indicated. In addition, a hydrophobicity plot of the amino acid 
sequence is presented. The plot was generated according to the method of Kyte and Doolittle (1982) 
using a window size of 21 (Snijder et al., 1990b]. 
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displays a high (homologous) recombination frequency (Makino and Lai, 1989; 
Spaan et al., 1988; Lai, 1990, 1992), which may be a consequence of the replica- 
tion strategy and replicase properties of coronaviruses. 

Although only the BEV genome has been (partially) sequenced, the toro- 
viruses appear to follow the example of the coronaviruses. Thus, indications for 
two independent recombination events during BEV evolution have been ob- 
tained (Snijder et al., 1991a]. 

The first putative recombination involves BEV ORF 4. Its 142-aa sequence 
shares 30-35% identical amino acid residues with the C-terminal parts of both 
the influenza C virus (IVC) and the coronavirus hemagglutinin esterase (HE] 
protein sequences, which were previously found to be related by common 
ancestry and heterologous recombination (Luytjes et al., 1988; see also Chapter 
8, this volume). Since the sequences corresponding to the 5' two thirds of the 
HE gene are lacking in BEV, ORF 4 has been hypothesized to be a pseudogene 
(Snijder et al., 19914]. 

The second putative recombination event that has occurred during BEV 
evolution is quite similar to the case of ORF 4. The C-terminus of the BEV ORF 
la protein contains 31 to 36% identical amino acid residues compared with the 
N-terminal 190 aa of the nonstructural coronavirus protein, ns30/32kDa (Bre- 
denbeek et al., 1990; Cox et al., 1989). Like the HE protein gene, this ns protein 
is found only in the coronaviruses closely related to MHV (Snijder et al., 1991a], 
where it is located between the polymerase and HE genes (Fig. 10). Apparently a 
sequence related to the 5' two thirds of this coronavirus nonstructural gene has 
been integrated into ORF la of BEV and is now expressed as part of its replicase. 

Remarkably, both recombinations previously described associate toro- 
viruses with coronaviruses, in particular with the antigenic cluster to which 
MHV belongs, and it is clear that these sequence similarities must be the result 
of nonhomologous RNA recombination events. However, they do not imply 
that toroviruses are more closely related to the MHV cluster than to other 


ORF1a ORF4 





FIGURE 10. Schematic representation of the genome organization of the torovirus BEV and the 
coronaviruses MHV and IBV. The replicase gene (1a and 1b) and the structural genes S, M, and N are 
indicated. Filled (ORF 4/HE) and cross-hatched (ORF 1la/ns30kDa) boxes indicate homologous 
sequences of BEV and MHV that are thought to be derived from independent recombination events 
(Snijder and Horzinek, 1993). 
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coronaviruses. First, the homologous sequences are located at different posi- 
tions in the genome (Fig. 10). Second, the BEV replicase is not more closely 
related to the MHV replicase than to, e.g., that of IBV. Third, the sequence 
similarities between the S and M proteins of corona- and toroviruses are so low 
(see the previous paragraph) that the high similarities discussed above (73096 
identical residues) can only be explained if divergence between BEV ORF 4 and 
the coronavirus HE gene, and between the BEV ORF 1a fragment and the 
coronavirus ns30/32-kDa gene is a more recent event than the divergence of the 
other homologous genes of toro- and coronaviruses. 

Considering the fact that several extant representatives of both virus 
groups cause enteric infections, direct recombination between toro- and coro- 
naviruses during coinfection of the same cell is feasible. However, the involve- 
ment of a third party of viral or cellular origin cannot be excluded. 


IX. CONCLUDING REMARKS 


The history of torovirus research illustrates the impact of molecular virol- 
ogy on the taxonomy of viruses. The taxonomic career of the toroviruses—at 
first proposed as a new family, then a free-floating genus, and finally a genus in 
the Coronaviridae—was guided by increasingly detailed knowledge of toroviral 
genes and proteins. 

The BEV genome has turned out to be a showcase for the two driving forces 
in RNA virus evolution: divergence from a common ancestor and RNA recom- 
bination. Traces of these processes are revealed in the primary protein sequence 
homologies in the replicase, conserved structural properties (in spite of diverged 
primary sequences) in the S and M proteins, the presence of apparently unre- 
lated N proteins coupled to a clearly different nucleocapsid architecture, and 
finally the recombination events discussed in the previous paragraph. Thus, the 
comparative analysis of the genomes and proteins of toroviruses and corona- 
viruses, supplemented with the data from the molecular characterization of the 
arteriviruses (see Chapter 12, this volume), has increased our understanding of 
the replication and evolution of all three virus groups. 
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CHAPTER 12 


The Coronaviruslike 
Superfamily 


Eric J. SNJDER AND WILLY J. M. SPAAN 


I. INTRODUCTION 


Until recently, the Coronaviridae was classified as a monogeneric family of 
closely related viruses. However, in the past four years, it has become evident 
that similarities in the genome organization, replication strategies, and nucleo- 
tide sequences of coronaviruses, toroviruses, and arteriviruses, require a revi- 
sion of this taxonomy. The “superfamily” concept (Strauss and Strauss, 1988; 
Goldbach and Wellink, 1988), which is based on evolution and phylogeny and 
which has already closed the gaps between other virus groups (e.g., the alpha- 
viruslike and picornaviruslike superfamilies}, can now also be applied to a 
group of "coronaviruslike viruses." 

The sequence analysis of the genomes of the coronaviruses infectious 
bronchitis virus (IBV) (Boursnell et al., 1987) and mouse hepatitis virus (MHV} 
(Bredenbeek et al., 1990; Lee et al., 1991), the Berne torovirus (BEV) (Snijder et 
al., 1990a,b|, and the (at the time] “unclassified togavirus” equine arteritis virus 
(EAV) (Den Boon et al., 1991b) revealed unexpected evolutionary links. The 
common features of these viruses are centered around the coronaviruslike 
replicase gene and its associated replication and expression strategy. Conse- 
quently, the corona- and toroviruses have now been formally joined together in 
the family Coronaviridae (Chapter 11, this volume}, and the International Com- 
mittee on the Taxonomy of Viruses has recently created a new study group to 
establish a virus family that will comprise the members of the presently free- 
floating genus arterivirus: EAV (the prototype of the genus), lactate dehydro- 
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genase-elevating virus (LDV), simian hemorrhagic fever virus (SHFV), and the 
recently discovered porcine reproductive and respiratory syndrome virus 
(PRRSV; also known as Lelystad virus). The question of how the evolutionary 
links between the Coronaviridae and arteriviruses should be reflected in their 
taxonomic status is still open to discussion (Snijder and Horzinek, 1993; Cava- 
nagh et al., 1994). 

The biological properties of EAV, LDV, and SHFV have recently been re- 
viewed by Plagemann and Moennig (1992). The morphological characteristics 
and genome size of arteriviruses (12-15 kb) are most comparable to those of the 
Togaviridae. However, as will be described, the arterivirus replicase and replica- 
tion strategy are strikingly similar to those of the Coronaviridae. In this short 
comparative chapter, we will briefly summarize the molecular characteristics 
of the arteriviruses and focus on the most important similarities and differences 
with the Coronaviridae. 


II. ARTERIVIRUSES 


EAV was first isolated from a fetus aborted during an endemic disease 
outbreak in pregnant mares (Doll et al., 1957). The sequence of the 12.7-kb 
positive-stranded, polyadenylated EAV genome was published in 1991 by Den 
Boon et al. (1991b). Two years later, the molecular characterization of PRRSV of 
swine (Meulenberg et al., 1993a) and LDV of mice (Godeny et al., 1993a) re- 
vealed slightly larger genome sizes: 15.1 and 14.2 kb, respectively. At the IXth 
Congress of Virology in Glasgow, 1993, the first sequence data from the SHFV 
genome were reported and confirmed its close relationship to the other three 
arteriviruses (Godeny et al., 1993b). 

Due to their morphology, EAV and LDV were initially classified as mem- 
bers of the togavirus family. The spherical arterivirus particle has a diameter of 
50 to 70 nm (Hyllseth, 1973) and is comprised of an icosahedral core structure of 
35 nm surrounded by an envelope carrying ringlike structures with a diameter 
of 12 to 15 nm (Horzinek et al., 1971). The identification and characterization of 
four structural EAV proteins has been reported recently (de Vries et al 1992): a 
12-kDa nucleocapsid (N) protein, an unglycosylated 18-kDa transmembrane 
protein (M), a 25-kDa glycoprotein G,, and a second glycoprotein, G,, which, 
due to heterogeneous glycosylation, has sizes between 30 and 42 kDa. The 
current model of the EAV particle is shown and compared with the Corona- 
viridae in Fig. 1. It is clear that the nucleocapsid architecture (a classic trait for 
viral taxonomy, with the same ranking as nucleic acid type or the presence of an 
envelope), is different for arteriviruses (icosahedral) and Coronaviridae (heli- 
cal). Another important difference is the fact that the EAV envelope does not 
bear the elongated spikes which are so characteristic of the Coronaviridae. 


III. ARTERIVIRUS GENOME ORGANIZATION AND EXPRESSION 


The arterivirus genome is a polycistronic RNA that contains 8 open read. 
ing frames (ORFs) in the case of EAV, PRRSV, and LDV (Fig. 2). Like Corona- 
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FIGURE 1. Schematic representation of the structure of BEV (a torovirus), EAV (an arterivirus), and 
MHV (a coronavirus}. The major structural proteins of each virus group are illustrated. 


viridae, arteriviruses produce a 3'-coterminal nested set of viral mRNAs in the 
infected cell. These mRNAs range in size from genome length to about 0.7 kb 
and are used to express internally located ORFs (de Vries et al., 1990; Kuo et al., 
1991; Meulenberg et al., 1993a). Viral subgenomic (sg) RNAs are composed of 
leader and body sequences that are not contiguous on the genome, another 
striking similarity with coronaviruses (but possibly not with toroviruses; see 
Chapter 11, this volume]. 

The EAV ORFs 2, 5, 6, and 7 have been shown to EE the structural G,, 
G , M, and N proteins, respectively (de Vries et al., 1992). The structural charac: 
teristics of the products of ORFs 3 and 4 are typical of membrane proteins, but 
no information on the function of these proteins has been obtained so far. 

The arterivirus replicase gene is between 9.7 (EAV) and 11.6 (PRRSV) kb in 
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FIGURE 2. Genome organization of MHV (a coronavirus}, BEV (a torovirus), and EAV (an arteri- 
virus). The genes encoding the viral replicases and major structural proteins are illustrated. 
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size and is comprised of two ORFs (la and 1b] that are expressed from the 
genomic RNA. As in the case of Coronaviridae, the arterivirus ORF 1b is 
probably expressed by ribosomal frameshifting during the translation of ge- 
nomic RNA (den Boon et al., 1991b). “Shifty” sequences (5, UUUAAAC 3’ for 
LDV and PRSSV; 5’ GUUAAAC 3' for EAV] and a downstream RNA pseudo- 
knot structure are thought to form the frameshift signals (Jacks et al., 1988; 
Brierley et al., 1989; Ten Dam et al., 1990). The ORF la/ORF 1b frameshift 
mechanism and the RNA structures involved in this process are remarkably 
conserved in corona-, toro, and arteriviruses, indicating that translational 
frameshifting is an ancient and probably essential regulatory step in replicase 
gene expression. 


IV. mRNA SYNTHESIS 


The generation of a 3'-coterminal nested set of mRNAs from a nonseg- 
mented genome distinguishes the members of the coronaviruslike superfamily 
from other groups of positive-stranded RNA viruses. The common ancestry of 
corona-, toro-, and arteriviral replicases (see Section V} suggests that this expres- 
sion strategy is related to the properties of the coronaviruslike replicase and 
that similar (in fact, homologous) transcription mechanisms may be used by 
these viruses. 

An early analysis of EAV mRNA transcription by means of UV transcrip- 
tion mapping suggested that all sg RNAs are derived from the processing of a 
genome-length precursor molecule (van Berlo et al., 1982), a situation that 
would be very different from the independent transcription of coronavirus and 
torovirus mRNAs, which was deduced using the same experimental approach 
(Jacobs et al., 1981; Snijder et al., 1990b}. To resolve this point, the UV transcrip- 
tion mapping analysis of both EAV and MHV at a late time point in infection 
has recently been repeated (Den Boon et al., 1995b], and in contrast to the 
observations of van Berlo et al. (1982), a clear correlation between UV target size 
and physical size for the sg RNAs of EAV was established. However, a quantita- 
tive analysis indicated that for both arteriviruses and coronaviruses this cor- 
relation does not justify the conclusion that the sg RNA transcription is fully 
independent from the synthesis of the genomic RNA (Den Boon et al., 1995b). 
Nevertheless, splicing can be ruled out as the major mechanism for arterivirus 
sg RNA production late in infection. 

The comparative analysis of mRNA transcription of corona- and arteri- 
viruses was extended by preparing replicative form (RF) RNA in order to study 
the EAV minus strands (Den Boon et al., 1995c). As in coronavirus-infected cells 
(Sethna et al., 1989; Sawicki and Sawicki, 1990), sg RFs were detected, suggest- 
ing that complements of both genomic and sg positive strands are generated. 
However, it remains to be excluded that the sg negative strands of EAV are 
derived from cleavage of RNase-sensitive sites in a genome-length RF. For a 
discussion on the possible role of the sg minus strands of coronaviruses, the 
reader is referred to Chapter 2 (this volume}. 

Both corona- and arteriviral mRNAs contain a leader sequence at their 5' 
end that is identical to the 5' end of their respective genomes (de Vries et al., 
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1990). Thus the generation of corona- and arterivirus subgenomic mRNAs must 
involve a process of discontinuous transcription. The arterivirus leader RNA 
has between 170 and 210 nucleotides and is, therefore, two to three times larger 
than that of coronaviruses. For EAV, LDV, and PRRSV, conserved AU-rich 
sequences have been identified at the leader-to-body junction (LBJ) sites of the 
mRNAs (Fig. 3) (de Vries et al. 1990; Den Boon et al., 1995c; Chen et al., 1993; 
Meulenberg et al., 1993b). Coronavirus genomes contain similar conserved AU- 
rich boxes (known as intergenic sequences) at the fusion sites of leader and 
mRNA bodies (reviewed by Spaan et al 1988; Lai, 1990). However, since most 
of the arterivirus ORFs 2 to 6 overlap to some extent, the coronavirus term 
“intergenic sequence" is inappropriate for arteriviruses. Although no evidence 
for a common leader sequence in torovirus mRNAs has been obtained, a con- 
served AU-rich sequence, which is thought to function as a promoter sequence 
for mRNA transcription, is also found upstream of ORFs 2 to 5 of the Berne 
virus (Fig. 3) (Chapter 11, this volume). 

In view of its relatively small size, it is, perhaps, not surprising that the 
arterivirus LBJ sequences are found at sites not involved in mRNA synthesis. 
The EAV 5' UCAAC 3' LBJ motif, for example, occurs 18 times in the genomic 
sequence. Thus, it is evident that this sequence element cannot be the sole 
determinant of mRNA transcription. 

The role of the LBJ motif in mRNA synthesis has not yet been fully 
elucidated. As in coronaviruses, the extreme 3' end of the arterivirus leader 
sequence is identical to the LBJ sequence. Thus, it is possible that, in the 
arterivirus minus strand, the complements of LBJ sites may function as mRNA 
promoters in a discontinuous leader-primed transcription mechanism involv- 
ing basepairing between the leader and the promoter region. This process may 
correspond to the model that has been put forward for coronavirus mRNA 
transcription (Baric et al., 1983; Lai et al., 1983; Spaan et al., 1983). A basepairing 
interaction between leader and mRNA promoter sequences on the negative 
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strand is still the most attractive explanation for the discontinuous step dur- 
ing coronavirus mRNA transcription. Since the coronavirus system has been 
studied more extensively, we would refer the reader to Chapter 2 (this volume) 
for a more detailed discussion of this subject. 


V. THE CORONAVIRUSLIKE REPLICASE 


Genome replication is a fundamental process in the viral life cycle and viral 
replicase genes are at the hub of viral biology. Replicase proteins are conserved 
among seemingly disparate groups of plant and animal RNA viruses (Strauss 
and Strauss, 1988; Goldbach and Wellink, 1988), and this evolutionary link is 
clearly apparent in the replicases of the coronaviruslike (CVL) superfamily. The 
comparison of the replication strategy and replicase properties of corona-, toro-, 
and arteriviruses has clearly distinguished the CVL replicase module from 
those of alpha, picorna, and flaviviruses. The organization of the CVL replicase 
in two ORFs, expression of the gene by ribosomal frameshifting, and the ar- 
rangement of conserved domains within the gene product are unique. 

Although arterivirus replicase genes are considerably smaller (9.5—12. kb] 
than their toro- and coronavirus counterparts, they contain a number of con- 
served domains that are present in the same relative positions (Den Boon et al., 
1991b). The sequence alignments (Fig. 4) of these domains reveal up to 30% 
amino acid sequence identity in the most conserved regions, a percentage that 
cannot be due to convergent evolution. The conservation of two of these do- 
mains (polymerase and helicase), which are common to all positive-stranded 
RNA viruses (Poch et al., 1989; Gorbalenya and Koonin, 1993), is not very 
surprising, their presence indicates that these viruses have probably all de- 
scended from the same RNA virus prototype. It is remarkable, however, that 
only in CVL replicases the helicase domain is located downstream of the 
polymerase motif. The polymerase motif also carries another CVL replicase 
trademark: the substitution of the classic GDD in the core of the motif by an 
SDD. Also, the conservation of additional replicase domains, for example, the 
carboxyl-terminal ORF 1b domain, for which no homologue can be found in 
other viral replicases, clearly indicates that the CVL replicases are more related 
to each other than to any other group of positive-stranded RNA viruses. 


VI. PROTEOLYTIC PROCESSING OF THE CVL REPLICASE 


Proteolytic processing of nonstructural proteins fulfills a key role in the life 
cycle of most viruses. In the course of virus evolution, highly specific virus- 
encoded proteases have evolved, and their importance for the regulation of virus 
replication is becoming more and more evident. For detailed information on 
this topic, the reader is referred to an excellent review by Dougherty and Semler 
(1993). 

CNL replicase gene expression leads to the production of an ORF la/1b 
fusion protein that is large in the case of arteriviruses (345-420 kDa] (Den Boon 
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et al., 1991b; Meulenberg et al 1993a; Godeny et al., 1993a) and extremely large 
in coronaviruses (740—810 kDa) (Boursnell et al., 1987; Lee et al., 1991; Herold et 
al., 1993) and probably also in toroviruses. The presence of multiple putative 
protease domains in the ORF 1a proteins of both coronaviruses (Gorbalenya et 
al., 1989a; Lee et al., 1991; Herold et al 1993) and arteriviruses (Den Boon et al., 
199]b; Meulenberg et al., 1993a; Godeny et al., 1993a) suggests that extensive 
proteolytic processing is involved in the regulation of CVL replicase function. 
This assumption is supported by the fact that the characterization of temperature- 
sensitive mutants of MHV revealed the presence of at least five RNA - comple- 
mentation groups (Schaad et al., 1990; Baric et al., 1990). 

A number of replicase cleavage products have recently been detected in 
MHV-infected cells (Denison et al., 1992) and in in vitro translation reactions 
programmed with genomic RNA (Denison et al., 1991). The preliminary pro- 
cessing scheme of the EAV ORF 1a protein comprises at least five proteolytic 
cleavages carried out by three different viral proteases (Snijder et al., 1994a). 
Combined with cleavage site predictions based on the putative protease types 
encountered in corona- and arteriviruses (Gorbalenya et al., 1989a; Lee et al., 
1991; Godeny et al 1993a], these experimental data forecast the generation of at 
least 10 (and possibly up to 20) cleavage products from the CVL ORF 1la/1b 
protein. If this estimate is correct, the number of intermediary processing 
products could be very large. This implies that the CVL replicase processing 
may be very complex, especially since the processing analysis of other viral 
replicases has revealed that intermediary products can be functional subunits 
themselves, e.g., in the replication of poliovirus (Jore et al., 1988; Ypma-Wong 
et al., 1988) and Sindbis virus (de Groot et al., 1990; Strauss and Strauss, 1990; 
Shirako and Strauss, 1994). 

The functional characterization of the proteases in the CVL ORF la se- 
quences, which belong to different protease supergroups, has only just begun. 
However, a general pattern for coronaviruses and arteriviruses seems to emerge 
from comparative replicase sequence analysis. In comparable positions, the 
ORF la proteins of both virus groups contain a domain belonging to the protease 
superfamily that comprises the chymotrypsinlike and picornavirus 3C-like 
proteolytic enzymes (Fig. 5) (Gorbalenya et al., 1989b; Bazan and Fletterick, 
1988). Although the predicted catalytic nucleophile of the coronavirus protease 
(Cys) differs from that in the arterivirus proteases (Ser], this domain may still be 
a remnant from a common ancestor of both virus groups. The exchange of Cys 
for Ser at the active site of the enzyme is considered to be feasible (Gorbalenya 
et al., 1989b; Bazan and Fletterick, 1988, 1990; Dougherty and Semler, 1993). 

The functions encoded from the central region of ORF la to the 3'-end of 
ORF 1b appear to be the core of the CVL replicase polyprotein: the well- 
conserved domains (protease-polymerase-helicase-C-terminal “unique” ORF 
1b domain) are within this area, and only small insertions and deletions in this 
part of the replicase can be detected within the coronavirus or arterivirus groups 
(Fig. 5). The chymotrypsin/3C-like protease is most likely to be responsible for 
the processing of the core replicase. 

The N-terminal half of the ORF 1a protein, on the other hand, is quite 
variable: the variability of the replicase gene in this area is largely responsible 
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FIGURE 5. Scaled representation of the replicase gene organization of coronaviruses, toroviruses, 
andarteriviruses. The filled circle indicates the ORF la/1b frameshift site. The largest replicase gene 
(MHV) is about 22 kb, the smallest one (EAV) approximately 9.5 kb. Boxes represent conserved 
domains: PCP, papainlike cysteine protease; 3CCP, 3C-like cysteine protease; CSP, chymotryp- 
sinlike serine protease; POL, polymerase; HEL, helicase; CTD, carboxyl-terminal ORF 1b domain. 


for the size differences in corona- or arterivirus genomes. A comparison be- 
tween the coronavirus and arterivirus N-terminal ORF la protein sequences 
does not yield any significant similarities, and even within the coronavirus and 
arterivirus groups there is little conservation in this region. 

However, one striking observation can be made. Both corona- and arteri- 
viruses contain putative protease domains in the amino-terminal half of their 
ORF la proteins. These proteases have been shown, or are predicted, to belong 
to the papainlike cysteine protease superfamily (Fig. 5) (Gorbalenya et al., 1991). 
A number of these proteases have recently been characterized. In EAV (Snijder 
et al., 1992) and MHV (Baker et al., 1993), a papainlike protease is responsible for 
the production of an amino-terminal ORF 1a cleavage product. Furthermore, 
the production of the next amino-terminal cleavage product from the EAV ORF 
la protein has now been shown to be mediated by an autoproteolytic activity 
that is most similar to papainlike proteases (Snijder et al., 1995). Two other 
arteriviruses, PRRSV and LDV, have been found to use even a third, papainlike 
protease in the processing of the N-terminal region of the ORF la protein (Den 
Boon et al., 19952). During arterivirus evolution, EAV appears to have lost the 
protease function that is responsible for the production of an additional amino- 
terminal cleavage product in PRRSV and LDV (Fig. 5]. 

Unfortunately, only the 5'- and 3'-sequences of the Berne torovirus ORF la 
region (about 1 kb from each end) have been determined so far and no putative 
proteases have been identified in these regions. Their future identification 
would be very useful, especially for a comparative analysis of the CVL 
chymotrypsin/3C-like proteases. 
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VII. STRUCTURAL PROTEINS 


Viral structural proteins are known to diverge at a higher rate than non- 
structural proteins. This is exemplified by the relationship between the corona- 
and torovirus replicase, S, and M proteins (Chapter 11, this volume), a set of 
genes that is thought to be related by common ancestry. Whereas, undisputable 
sequence homologies are present in the replicase proteins of these viruses, these 
are absent in the S and M protein sequences, which only display structural 
similarities. The corona- and torovirus N proteins, on the other hand, are so 
different in size and sequence that common ancestry is highly unlikely. Also, 
the nucleocapsid structures that they form together with the genomic RNA are 
significantly different. It therefore has been proposed that during evolution 
either the coronavirus or the torovirus group may have acquired a novel N 
protein gene (Snijder and Horzinek, 1993). 

When the arteriviruses are included in the comparison of the structural 
proteins of CVL replicase-containing viruses, it is hard to detect any similarity 
at all, either in size, sequence, or structural characteristics (Den Boon et al., 
1991b; de Vries et al., 1992). The N protein (12kDa) is even smaller than that of 
toroviruses and gives rise to an icosahedral nucleocapsid. The presently known 
surface glycoproteins G, and G, are small compared to the S proteins of corona- 
and toroviruses. Only the presence of a triple-spanning membrane (M) protein, 
encoded by arterivirus ORF 6, is similar to the Coronaviridae, even though this 
protein (18kDa) is also smaller than its coronaviral counterparts. The triple- 
spanning membrane proteins of corona- and toroviruses have been implicated 
to play a crucial role in determining the intracellular budding process of these 
viruses (Dubois-Dalcq et al., 1982, Holmes et al 1981; Rottier et al., 1981; Den 
Boon et al., 1991a). Since arteriviruses also assemble at intracellular mem- 
branes, the conservation or acquisition of an M protein with similar structural 
properties may have been expected during arterivirus evolution (de Vries et al., 
1992). 


VII. THE EVOLUTION OF THE CVL SUPERFAMILY 


The common ancestry of the CVL replicase proteins and the associated 
replication strategy are evident. The features that are shared by the members of 
the CVL superfamily are the basic genome organization, namely, "replicase- 
envelope protein-nucleocapsid protein," the production of 3'-coterminal 
nested sets of mRNAs, and the conserved organization of the replicase gene. 
However, the N proteins and nucleocapsid structures of the three CVL genera 
are apparently unrelated. Furthermore, the arterivirus surface proteins show no 
evidence of a common ancestry. 

The coupling of different sets of structural genes to the same replicase gene 
has been explained by the recombination of complete genes or gene sets (mod- 
ules) (Zimmern, 1987; Strauss and Strauss, 1988; Goldbach and Wellink, 1988). 
The RNA recombination frequency during MHV replication has been shown to 
be remarkably high (reviewed by Spaan et al., 1988; Lai, 1992), a characteristic 
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that may be determined by their replicase properties. The different sets of 
structural genes (and the varying number of "additional" genes) that are now 
known to be linked to the CVL replicase gene indicate that this characteristic 
may be shared by other members of the superfamily ("modular" evolution]. 
Direct evidence for multiple recombination events during torovirus evolution 
has already been obtained (Snijder et al., 1991). Together with divergent evolu- 
tion, a high recombination frequency can account for the diverse composition 
of CVL genomes. It has been hypothesized that the acquisition of a novel 
nucleocapsid protein gene, giving rise to a helical virus core rather than an 
icosahedral one, may have created the possibility for the corona/torovirus 
branch to diverge from the arterivirus branch (Godeny et al., 1993a). The in- 
creased genome size of the corona- and toroviruses could in this case be ex- 
plained by the more relaxed packaging constraints of a helical nucleocapsid 
structure. 

Arteriviruses and, especially, coronaviruses possess exceptionally large 
genomes in comparison with other positive-stranded animal RNA viruses. The 
apparent gap between the genome sizes of arteriviruses (EAV:12.7 kb) and coro- 
naviruses (MHV:31.5 kb) is slowly being reduced now that the sequences of 
larger arterivirus genomes (PRRSV:15.1 kb) and smaller coronavirus genomes 
[human coronavirus (HCV) 229E:27.3 kb| have been determined. A group of 
positive-strand RNA viruses that almost bridge the remaining gap are the plant 
closteroviruses, with genome sizes between 7 and 20 kb. The molecular biology 
properties of this virus group has recently been reviewed by Dolja et al. (1994). A 
number of remarkable similarities with the CVL group is found in the clostero- 
virus group: e.g., polycistronic genomes, frameshifting during replicase expres- 
sion, and the generation of a 3'-coterminal nested of mRNAs to express the 3’ 
proximal genes. Although phylogenetic studies, including closterovirus and 
CVL replicase sequences, indicated that these similarities are analogies rather 
than homologies (Koonin and Dolja, 1993], this example of convergent evolu- 
tion may elucidate some general principles governing the evolution of viruses 
with large RNA genomes. The size variation and plasticity of both closterovirus 
and CVL genomes is remarkable, suggesting again the frequent involvement of 
recombination during the evolution of these virus groups (Dolja et al., 1994). 
The process of recombination could also partially compensate for the general 
high error frequency of viral RNA-dependent RNA polymerases, a feature that 
will greatly influence the stability and evolution of large RNA genomes. 


IX. THE TAXONOMY OF CVL VIRUSES 


The "split personality" of the arteriviruses, a CVL replicase gene coupled to 
a non-CVL set of structural genes, is an interesting test for viral taxonomists. 
Using traditional taxonomic criteria, e.g., the virion and nucleocapsid struc- 
ture, there is no reason to propose a taxonomic relationship between arteri- 
viruses and Coronaviridae. At the same time, certain characteristics, such as 
genome organization, replication strategy, and sequence homologies, provide a 
solid basis for a formal taxonomic status of the CVL superfamily. Any meaning- 
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ful classification of the members of the CVL group clearly requires four hier- 
archical levels (Fig. 6) (Snijder and Horzinek, 1993; Cavanagh et al 1994): the 
coronavirus and torovirus species have now been classified into two genera that 
belong to the Coronaviridae family. The obvious evolutionary link of this 
family to the arteriviruses would be reflected most accurately by promoting the 
present arterivirus genus to the family status and by establishing an order (to 
replace the "superfamily") comprising the Coronaviridae and Arteriviridae 
families. Classification of the arteriviruses as a third genus of the coronavirus 
family is a less attractive alternative because this would not recognize the more 
distant position of the arteriviruses. However, this problem could be circum- 
vented by establishing two subfamilies [Coronaviridae and Arteriviridae) 
(Cavanagh et al 1994) and changing the family name of the Coronaviridae. 
The taxonomic fate of the "CVL superfamily" will depend on the future 
weighing of both old (structural) and new (genetic) criteria for virus classifica- 
tion. The balance between these criteria will decide whether the genetic sim- 
ilarities between coronaviruses and arteriviruses outweigh their structural dis- 
similarities. If, as in other areas of biology, the taxonomy of viruses should have 
a genetic basis and should reflect phylogenetic relationships, the creation of an 
order (Fig. 6) for the Coronaviridae and Arteriviridae would be the most elegant 
solution to recognize both their similarities and differences (Snijder and Horzi- 
nek, 1993; Cavanagh et al., 1994). A taxonomic system that is unable to ac- 
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FIGURE 6. Pedigree of the CVL supertamily. The distances between the various CVL viruses and 
groups are shown arbitrarily. The four hierarchical (taxonomic) levels that can be discerned are 
indicated. Abbreviations: BEV, Berne virus; BRV, Breda virus; EAV, equine arteritis virus; HCV 229E, 
human coronavirus 229E, IBV, infectious bronchitis virus; LDV, lactate dehydrogenase-elevating 
virus; MHV, mouse hepatitis virus; PRRSV, porcine reproductive and respiratory syndrome virus; 
SHFV, simian hemorrhagic fever virus. For simplicity, the coronavirus clusters that include MHV 
and HCV 229E are shown as a single species. 
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commodate the results of virus evolution (mutation, recombination, and selec- 
tion) will have no future. 
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CHAPTER 13 


Pathogenesis and Diseases 

of the Central Nervous System 
Caused by Murine 
Coronaviruses 


SAMUEL DALES AND ROBERT ANDERSON 


I. INTRODUCTION 


This chapter is an account of studies of central nervous system (CNS) diseases 
connected with neurotropic variants of MCV such as J. Howard Muller Virus 
(JHM) and A59 and deals with animal models that may have relevance to an 
understanding of human diseases of putative viral etiology such as multiple 
sclerosis (MS). From the time of (JHMV} isolation from paralyzed mice by 
Cheever et al. (1949) and Bailey et al. (1949), this agent and related strains have 
provided copious data about encephalitic and demyelinating diseases in ro- 
dents. To date, however, any possible connection between murine coronavirus 
(MCV) and MS is tenuous. The reported isolation of coronavirus (CV) particles 
from MS patients’ brain (Burks et al., 1980) or electron microscopic visualiza- 
tion of CV-like particles in brain tissue of one MS patient (Tanaka et al., 1976), 
require confirmation. An older report of JHMV-induced panencephalitis in 
monkeys (Kersting and Pette, 1956), however, has been confirmed by Murray et 
al. (1992a) in their description of demyelinative disease in several monkey 
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species. This finding, coupled with the identification of viral RNA and protein 
within demyelinative plaques in human brain tissue from MS patients (Murray 
et al., 1992b; Stewart et al., 1992), are highly provocative data concerning an 
involvement of CV in the etiology of MS. However, a cautious evaluation of 
these results may be in order, in view of the recent demonstration of genomic 
and antigenic relatedness between the N protein of JHMV and the microtubule- 
associated protein tau (Pasick et al., 1994). 

The disease process associated with CV infection is highly variable, both in 
animals and at the cellular level, depending on the virus, cell types, and host 
species. In view of the voluminous literature on the subject, this relatively brief 
review is concerned primarily with disease processes in rodents infected by 
murine CVs and emphasizes infections of neural cells and pathogenesis in 
the CNS. 


II. VIRUS-CELL INTERACTIONS RELATED TO PATHOGENESIS 
AND DISEASE 


To comprehend the overall disease process elicited by neurotropic MCV, 
one should consider three major parameters. First, it is necessary to consider 
the complex organization of CNS tissue, involving a variety of cell types and 
lineages, that provides a great diversity of cell-cell interactions. Second, one 
must consider the highly error-prone synthesis of CV genomes that can undergo 
a high frequency of recombination, generating rapidly evolving mutants or 
variants (Lee et al., 1991; Lai, 1990). Such variants manifest altered tropism and 
virulence, as detailed in Section IV. Third, one must keep in mind the effects of 
host immunity evoked by CV infections. These responses, primarily of the 
cellular type, but also to some extent humoral, can modulate pathogenesis and 
disease, as described in Section III. 


A. Correlations between Infections in Vivo and with Explanted 
Neurons and Glia 


Infections of different tissues and organs in vivo develop according to the 
MCV type and the route of inoculation employed. The natural means for 
initiating infection is through intranasal (IN) instillation of virus, but introduc- 
tion through intraperitoneal (IP) and intracranial (IC) injections is also com- 
monly employed. Neonates and juveniles usually succumb more rapidly than 
adults, developing fulminant infections manifested by widespread destruction 
of tissues. In the CNS an early onset acute encephalitis develops. 

In adult mice, responses are variable depending on the MCV type and the 
genetic constitution of the host, as discussed in more detail in Section IV. When 
adult mice are challenged intranasally with the highly virulent murine hepa- 
titis virus S (MHV-S), occurrence of a rapidly fatal infection of the CNS is 
associated with generalized virus dissemination to the bowel, liver, spleen, and 
other organs (Taguchi et al., 1979; Barthold et al., 1986; Lavi et al., 1984a). 


MURINE CORONAVIRUSES AND DISEASE 259 


Histopathology in the CNS includes spongiform lesions within the brain stem. 
Comparable IN inoculation of neonates with neurotropic A59 and JHMV also 
results in a wide dissemination into the CNS, respiratory, and vascular endo- 
thelium and elsewhere (Barthold and Smith, 1984). 

By contrast, A59 and JHM, when given IN to adult mice, induce milder 
forms of disease. The viruses progress slowly through the olfactory tracts before 
spreading into the brain and spinal cord (SC) (Barthold et al., 1986). Encephalitis 
or subacute paralytic symptoms frequently ensue (Lavi et al., 1988; Barthold et 
al., 1986]. The viscerotropic MHV3 strain of MCV, when inoculated into resis- 
tant A/JX mice, fails to produce disease symptoms, but nevertheless replicates 
in liver and brain (Tardieu et al., 1986). Evidently MCVs belonging to the 
neurotropic or viscerotropic categories possess a more general tropism in the 
mouse than is indicated by their designation. 

After IN infection of mice, virus dissemination and progress of disease have 
been followed within the CNS and other tissues by means of antibodies and 
nucleic acid probes. The localization of viral antigen and RNA by means of in 
situ hybridization in sections has been highly informative. With C57BL/6 mice, 
in which acute encephalitis develops upon infection with A59 or JHMV, the 
virus becomes widespread within the cortical gray matter (GM|, brain stem, the 
white matter (WM) tracts in the regions of the optic chiasma, and the SC (Lavi et 
al., 1988; Perlman et al., 1988, 1989, 1990). A time-course reconstruction of 
events leads to the conclusion that virus spreads from the trigeminal olfactory 
locus along anatomically and functionally interconnected neuronal tracts. Dis- 
semination of virus along such neuronal pathways provides a rationale for the 
speed at which virus can spread toward and within the SC. Virus expression is 
detectable at the cellular level in neurons and glia, consistent with the histo- 
pathology associated with the acute encephalitic or progressive demyelinative 
diseases caused by A59 and JHM (Perlman et al 1988; Fleming et al., 1987; 
Spaan et al., 1988). In the mouse CNS the neurons and oligodendrocytes (OL) are 
the prominent cell types containing viral RNA and antigen (Lavi et al., 1987), 
although astrocytes (AS) may also become targets in primary explants. In such 
cell cultures from neonatal BALB/c or CD.1 mice, both OL and AS are infectable 
by MCV A59, JHM, and human coronavirus (HCV) OC43 (Wilson et al., 1986; 
van Berlo et al., 1989; Pearson and Mims, 1985). Likewise, explanted neurons 
can be productive host cells for A59 and JHMV (Dubois-Dalq et al., 1982; 
Knobler et al., 1981). 

Although the CNS disease process induced in the rat by MCV has many 
parallels with that observed in mice, notable differences are evident. They 
include an age-related refractoriness to infection, apparent by the time of wean- 
ing (Sorensen et al., 1987a]; a requirement for larger inocula, which are usually 
administered IC to promote efficiency, although neonates can be challenged 
successfully IN (Hirano et al., 1980); and an association of the CNS and para- 
lytic forms of disease with JHMV but not MHV3, although the latter can 
produce inapparent infections (Hirano et al., 1980). Again, the progress of infec- 
tion can be followed in the CNS by antigen and RNA probes, which demon- 
strate that dissemination occurs in a temporal sequence, culminating with 
virus spread within the SC (Sorensen et al., 1980, 1984b]. The cell types promi- 
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nently involved in infection include neurons and OL. In situ hybridization by 
means of cDNA probes has pinpointed the presence of viral RNA in hippocampal 
and cerebellar neurons. The virus may remain sequestered in these targets for 
prolonged periods, even during episodes of subacute, paralytic disease evident 
in the Wistar Furth (WF) rat model (Sorensen and Dales, 1985; Parham et al., 
1986). Rat CNS explants, cultured to promote neuronal survival in vitro, are 
likewise infectable with JHMV. In such cultures, replication occurs preferen- 
tially or exclusively in neurons (Pasick and Dales, 1991). The susceptibility of 
primary neuron explants to JHMV is consistent with them being primary 
targets for the development of acute encephalomyelitis (Knobler et al., 1981; 
Dubois-Dalq et al., 1982; Buchmeier et a1.,1984; Sorensen and Dales, 1985; 
Pasick and Dales, 1991; Matsubara et al., 1991; Zimprich et al., 1991). 

Contrary to the dual tropism of JHMV observed with murine cells, in our 
studies with explanted rat cells, this virus exhibits a reciprocally exclusive 
tropism for OL compared with MHV3, which infects only type 1 AS (Beus- 
hausen and Dales, 1985). This tropism pertains to explants from WF or Wistar 
Lewis (WL) strains. Our findings were confirmed with Wistar rat glial cultures 
(van Berlo et al., 1986}, although JHMV infection of both OL and AS was 
detected with glial cultures from WL rat CNS (van Berlo et al., 1989). In this 
case, the AS lineage (type 1 or 2) was not established. 

The inverse tropism to those that we determined has been described in 
CNS explants from WL rats (Massa et al., 1986). In this study, immunofluores- 
cence labeling indicated selective infection of AS rather than OL, with some 
modulation in tropism (Massa et al., 1988). These diverse findings on the 
tropism of JHMV for rat glia have not been reconciled completely, but could be 
related to the identity of the lineage of infectable AS. The presence of glial 
acidic fibrillary protein (GFAP) as filament bundles that characterize type 1 AS, 
also occur, albeit transiently, in the progenitor cell type O2A, from which both 
OL and type 2 AS differentiate (Raff, 1989; Lillien and Raff, 1990). Since imma- 
ture, rather than fully differentiated, rat OL appear to be susceptible to JHMV 
(Beushausen and Dales, 1985), it is likely, as shown by Pasick and Dales (1991), 
that GFAP+ O2A cells become infected and express virus antigen, accounting 
for the observations of Massa et al. (1986, 1988). 

In general, the tropism defined in vitro matches the observed distribution 
of CV infections of the CNS in vivo, and is also consistent with the patho- 
genesis and disease processes that become evident. 


B. Ligand—Receptor Interactions 


The surface component of CV that attaches to receptors on the host cell is 
the surface glycoprotein. This molecule possesses features generally associated 
with peplomers of enveloped viruses (Schmidt et al., 1987). Activation of the 
fusogenic and penetration activities of S is enhanced by proteolytic cleavage 
into the subunits S, and S, (Schmidt et a1., 1987; Luytjes et al., 1987; Frana et al., 
1985; Sturman et al., 1985; Stauber et al., 1993; Taguchi, 1993). However, attach- 
ment and neutralization by antibodies are not contingent on the proteolytic 
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processing of S (Luytjes et al., 1989; Collins et al 1983). The cell-to-cell spread 
of infection in the human line Medical Research Council (MRC-C) challenged 
with HCV 229E (Appleyard and Tisdale, 1985} or the murine cell line 17 Cl I 
challenged with A59 (Sturman et al., 1985; Frana et al., 1985), requires the 
activation of S by a cellular protease that is inhibited by agents such as leupep- 
tin (Appleyard and Tisdale, 1985). Also, virus dissemination is not absolutely 
contingent on the formation of syncytia, even after S has been processed into S, / 
S,. Thus, MHV A59 is highly fusogenic for 17C1 I and L cells, but in contrast to 
MHV3 and JHMV is nonfusogenic in vivo or for explanted rodent glial cells 
(Lucas et al., 1977; Sturman et al 1985; Frana et al., 1985; Lavi et al., 1987; 
Wilson and Dales, 1988). Differences in syncytiogenic activity among CVs 
remain unexplained. 

The receptor specificity for CV has also been studied as a determinant of 
tropism. In one system using rat glial cultures, one can demonstrate an unam- 
biguous tropism of JHMV for OL and MHV3 for AS, but there is no difference in 
the binding efficiency of labeled inoculum particles of either virus to either cell 
type (Beushausen and Dales, 1985). With continuous cell lines such as the rat 
RN2-2 Schwannoma, which is infectable by JHMV but not by MHV3, both 
virus types likewise become attached equally well (Lucas et al., 1977, 1978). In 
the case of the C6 rat astrocytoma cell line, which is resistant to infection by 
both viruses, receptors for JHMV, MHV3, and other MCV strains are present 
(van Dinter and Flintoff, 1987; Flintoff and van Dinter, 1989; Kooi et al 1988). In 
the case of sublines derived from the prototype L strain murine line, the more 
restrictive subline LM-K can absorb A59 virus as efficiently as the highly 
permissive L2 cell (Kooi et al., 1988). A similar situation is found with cells 
from SJL/J mice, which are highly resistant to JHMV but not to MHV3. Thus, 
both viruses can be adsorbed with the same efficiency to OL and AS of SJL/J 
mice and to OL and AS of permissive BALB/c and CD.1 mice (Wilson et al., 
1986). | 

A presumed absence of receptors for A59 on SJL/J hepatocytes and entero- 
cytes, which was assumed to account for differences in resistance of SJL/J mice 
compared to susceptible BALB/c mice (Boyle et al 1987), also can now be 
explained. The MHV receptor is a 110- 120-kDa G-P, a member of the carcino- 
embryonic antigen family (Williams et al. 1991). Although this G-P is expressed 
on SJL/J cells, its form is modified (Yokomori and Lai, 1992), presumably affect- 
ing the efficiency of JHMV replication. Whatever the deficiency of receptors for 
JHMV and A59 might be in SJL/J mice, the resistance of neurons and glia from 
these mice is due to a defect in cell-cell virus spread rather than an absence of 
receptors per se (Wilson and Dales, 1988; Pasick er al 1992). Virus dissemina- 
tion may involve a deficiency in the processing of S because a more fusogenic 
JHMV variant ATf11, isolated from rat SC (Morris et al., 1989], is not restricted 
in SJL/J glial cells (Pasick et al., 1992). 

The identification of a hemagglutinin esterase (HE) glycoprotein compo- 
nent in BCV (Vlasak et al., 1988) drew attention to the possible function of HE 
during early interactions with host cells. By sequence comparison, HE shows 
evolutionary relatedness with the influenza type C hemagglutinin H, subunit 
(Kienzle et al., 1990]. The esterase of HE has receptor-binding activity and 
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inactivates receptors on erythrocytes by hydrolyzing O-linked acetylsialic acid 
(Schultze et al 1991). Since this reaction and virus entry are both blocked with 
diisopropyl fluorophosphate (Vlasak et al 1988], it is presumed that HE acts in 
a manner similar to that of a receptor-destroying enzyme of type C influenza 
virus (Kienzle et al., 1990). Evidently the esterase is not essential for infectivity 
because it is absent from MCV A59 and some MCV JHM strains (Parker et al., 
1990). Some of the roles suggested for HE are: as a determinant of tropism or 
virulence in the CNS; as an additional ligand for binding of cell targets; or for 
facilitating the penetration of inoculum virus (reviewed in Parker et al., 1990; 
Vlasak et al., 1988). 


C. Penetration 


An electron microscopic examination of the interaction between surfaces 
covering the microvilli in calf intestine inoculated with the calf diarrhea CV 
was interpreted as showing fusion between the virus envelope and plasma 
membrane on microvilli (Doughri et al 1976). Studies on cells in culture, using 
biochemical and electron microscopic methods, suggested that MCV initiates 
an infection following sequestration or viropexis of the inoculum (reviewed in 
Krzystyniak and Dupuy, 1984). The inhibition of early stages of A59 or MHV3 
infection, including the eclipse phase (Mizzen et al., 1985), in L2 cells treated 
with agents such as NH,C] and chloroquine (Krzystyniak and Dupuy, 1984), 
also supports a requirement for engulfment as an obligatory function for initiat- 
ing infection. More recent unpublished data from our laboratories indicate that 
NH,CI only marginally inhibits the early stages of MHV infection, implying 
that penetration and release of the genome occurs in "early" endosomes con- 
taining a neutral or only slightly acidic milieu due to their intracellular location 
close to the cell surface. (Schmid et al., 1989]. 

Studies of penetration by JHM, A59, and MHV3 into continuous neural and 
nonneural cell lines suggest that interactions with the host may vary. The 
virions that are absorbed onto and sequestered by C6 astrocytoma cells do not 
become eclipsed (van Dinter and Flintoff, 1987), due to an unidentified defect 
that can be circumvented by inducing fusion between viral and cell membranes 
by means of polyethylene glycol (PEG). PEG by itself does not enhance the 
infectiousness of permissive cells, among them L, RN2-2 Schwannoma, and 
HTC hepatoma lines. The failure of a “low pH shock” to promote infectability 
of MCV in the manner observed with vesicular stomatitis virus (VSV) and 
A-type influenza viruses also argues against involvement of an acidic endo- 
somal compartment in the penetration sequence (Flintoff and van Dinter, 1989). 
Host control over virus penetration, examined by means of cell-cell hybrids 
between the resistant C6 and either partially or fully permissive cell lines 
showed that C6 x L2 and RN-2 x L2 cell lines could be infected without the use 
of PEG (Flintoff and van Dinter, 1989; Coulter-Mackie et al., 1984). However, 
these permissive cell hybrids retain the phenotype of the parent that resists 
extensive syncytia formation. This demonstrates that penetration and MCV- 
induced cell-cell fusion are controlled independently. 
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D. Uncoating and Initiation of Genome Functions 


Although the ssRNA genome penetrates from inoculum virions into the 
host within a helical coat of N protein (Davies et al 1981), the infectivity of the 
naked genomic RNA (Siddell et al., 1982) implies that, in the normal course of 
penetration, the uncoating step involves dissociation of N from the RNA. The 
nascent N polypeptide of JHMV has a molecular weight of 50 kDa which 
increases to 56 kDa upon phosphorylation. The 56-kDa phosphorylated form of 
N exists in virions (Armstrong et al 1983; Wilbur et al 1986; Siddell et al., 
1981). From sequence data on N (Armstrong et al., 1983; Skinner and Siddell, 
1983) and other evidence (Wilbur et al 1986), one can deduce that N has two to 
four, or perhaps more, potential sites for phosphorylation, a modification ef- 
fected prior to virion assembly by either cellular kinases (PK) or a MCV- 
specified PK. In a converse sequence of events taking place during penetration, 
the molecular weight of N is reduced initially from 56 to 50 kDa, then further 
breakdown into smaller molecular weight polypeptide fragments occurs (Beus- 
hausen et al., 1987). Conceivably, dephosphorylation of N promotes the uncoat- 
ing process, as documented for retroviruses (Beushausen et al 1987; Mohandas 
and Dales, 1991). 

Our demonstration of the existence, in an endosome fraction of a cellular 
phosphoprotein phosphatase [(PPPase) belonging to the neutral type 1 class of 
enzyme, unpublished], with a high specificity for N (Mohandas and Dales, 
1990}, is consistent with the view that uncoating may either commence or 
proceed to completion in “early” endosomes and/or close to the cell surface. In 
this context, it is worth mentioning that endosomal PPPases of a type similar to 
that acting on N have been recognized in clathrin-coated vesicles, where they 
may catalyze the rupturing of polymeric clathrin protein cages and facilitate 
movement of the freed endosomes and fusion of the endosomal membrane with 
other membrane compartments (Pauloin et al., 1988; Greene and Eisenberg, 
1988; Loeb et al., 1989). 


E. Effect on Cellular Protein Synthesis 


Viruses have developed diverse strategies to ensure the translation of virus- 
specific mRNAs in the infected cell. MCV gives rise to mRNAs that share the 
structural characteristics of most eurkaryotic mRNAs, such as the presence of 
5’ terminal caps (Lai et al., 1982] and 3' poly A tails (Cheley et al., 1981; Cheley 
and Anderson, 1981). It appears unlikely, therefore, that translational control in 
MCV infection is mediated by the alteration of host initiation factors such as 
occurs, for example, in poliovirus infection and results in the selective transla- 
tion of structurally distinct (e.g., uncapped) mRNAs (Kaufman et al., 1976; 
Trachsel et al., 1980). 

At high multiplicities of infection, MCV inhibits host protein synthesis in 
certain cell lines. In mouse L2 fibroblasts, infection results in a decline in total 
protein synthesis to about 7% of controls. This overall decrease is likely due 
to a partial block of translation at the initiation stage, as indicated by an 
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increase in free 80S ribosomes. There is also a shift in size from heavier to 
lighter polysomes, reflecting increased competition by a greater number of 
shorter mRNAs for a limited quantity of available ribosomes. The overall total 
poly(A) *RNA content of the infected cell is increased approximately threefold 
by 6 hr postinfection, as a result of the synthesis and accumulation of viral 
mRNA. Most interestingly, by this time, and later in infection, cellular mRNA 
transcripts disappear, apparently due to a degradative mechanism (Hilton et al., 
1986). 

Previously, virus-induced hydrolysis of host mRNA had been noted after 
infection with complex DNA viruses, such as herpes simplex (Nishioka and 
Silverstein, 1978) and vaccinia (Rice and Roberts, 1983], and with influenza, an 
RNA virus. In the case of influenza, breakdown of host mRNA is a consequence 
of the unique mode of transcapping, i.e., the capture of host-derived 5'-caps 
(Bouloy et al., 1980; Plotch et al., 1981). Most other RNA viruses studied to date, 
however, appear to effect translational controls by nondegradative mechanisms 
(Fernandez-Munoz and Darnell, 1976; Gallwitz et al., 1977; Lodish and Porter, 
1981). Evidence has also been presented suggesting that during infections with 
some viruses, including adenovirus (Lazaridis et al., 1988) and La Crosse virus 
(Raju and Kolakofsky, 1988], viral functions are expressed that affect the stabil- 
ity of cellular or viral mRNAs. 

It is intriguing to speculate that the mechanism of RNA degradation con- 
nected with MCV infection (Hilton et al., 1986] is related to the unique replica- 
tion strategy of coronavirus. It has been suggested that the synthesis of CV 
subgenomic mRNAs may involve nucleolytic trimming (Lai, 1986). Although 
this type of nucleolytic activity has yet to be identified, if present, such a MCV- 
induced nuclease could also presumably recognize as a substrate cleavage sites 
in cellular mRNAs, thereby accelerating their hydrolysis by host degradative 
mechanisms. 

Another obvious cell-killing function of MCV infection is the inhibition of 
host translation (e.g., Marvaldi et al., 1977). Differences among host cells in 
their susceptibility to MCV-mediated translational control could be manifested 
in the cytopathology that occurs and also determine whether the outcome is an 
acute or persistent infection. 


F. Effects on Cellular Membranes 


The unique sites of CV assembly by budding from smooth endoplasmic 
reticulum membranes, recognized by electron microscopy (Massalski et al., 
1982; Barthold and Smith, 1984; Pasick et al., 1994; Krijnse-Locker et al., 1994; 
Klumperman et al., 1994], has not yet been explained. Earlier, it was thought 
that the M protein had an important role in organizing the assembly of virion 
envelopes on these internal membranes, mainly because this protein has sig- 
nals for retention in the Golgi (Swift and Machamer, 1991). Recently, however, 
it has become apparent that the Golgi compartment is beyond the site of 
budding (Klumperman et al., 1994). One reason suggested for M being anchored 
and retained on internal membranes is the preferential O-linked glycosylation 
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(Rottier et al., 1984; Niemann and Klenk, 1981; Niemann et al., 1984). In 
contrast, the S protein, which possesses N-linked carbohydrates, behaves like 
the usual peplomer G-P of viruses budding from the cell surface. Thus, although 
S attaches to the formative CV envelopes within the budding compartment, it 
can also migrate independently to the plasma membrane where it accumulates 
and can express fusogenic activity (Rottier et al., 1984; Niemann et al., 1984). 

Electron microscopic examination of MCV-infected cells has revealed ap- 
parent distensions or amplification of intracellular membranes that may be 
derived from smooth endoplasmic reticulum (David-Ferreira and Manaker, 
1965; Massalski et al., 1982; Dubois-Dalq et al., 1982; van Berlo et al., 1986; 
Tooze et al., 1984]. It has been speculated that such membrane proliferation or 
reorganization may reflect a cellular response to the removal of membrane lipid 
as it becomes incorporated into virion envelopes (Tooze et al., 1984). This 
explanation seems to be unlikely since membrane proliferation has been ob- 
served during infection with a temperature-sensitive (ts) mutant of MCV, when 
virion assembly was interrupted (van Berlo et al., 1986). The stimulation of 
membrane proliferation and lipid biosynthesis resulting from cytopathology 
have also been observed following infections with other agents, among them 
nonenveloped polio- and mengovirus (Penman, 1965; Amako and Dales, 1967. 

A very obvious manifestation of virus-cell membrane interactions occur- 
ring within the replication cycle of MCV is membrane fusion. Fusion occurs 
both early during virion entry and uncoating and later when syncytiogenesis is 
evident. The control of cell-cell fusion is closely related to events that contrib- 
ute to the establishment of a cytolytic versus an attenuated or persistent 
infection (Kooi et al., 1988; Mizzen et al., 1983, 1985, 1987b). 

The S proteint is a key mediator for both cell fusion and pathogenesis. 
Within the infected CNS, differences among cell types have been noted in 
relation to virus-induced cell fusion (Sorensen et al., 1980; Dubois-Dalcq et al., 
1982; Buchmeier et al., 1984). By inducing cell-cell fusion from within, MCV 
may facilitate both dissemination of virus and maintenance of persistent infec- 
tions, as demonstrated in the presence of neutralizing antibodies added to the 
medium (Buchmeier et al., 1984; Sorensen et al., 1984a; Stohlman and Weiner, 
1978). In cultures, such as the Cl-1 line, which is relatively resistant to JHMV- 
mediated syncytiogenesis, individual cells may survive a cytolytic, acute infec- 
tion, thereby maintaining a state of virus persistence (Mizzen et al., 1983). 

The resistance of murine cells to various strains of MCV appears to be 
under the control of a single autosomal, recessive gene (Bang and Warwick, 
1960; Virelizier and Allison, 1976; Stohlman and Frelinger, 1978). Concomi- 
tantly, the complete or partial genetic resistance to disease observed with some 
strains of mice is inherent in explanted macrophages, hepatocytes, fibroblasts, 
and other cell types. The infection of cells from such mice results in either an 
abortive (Virelizier and Allison, 1976) or limited infection, with reduced cyto- 
pathology (MacNaughton and Patterson, 1980; Arnheiter et al., 1982; Lamon- 
tagne and Dupuy, 19842). Although cultured macrophages, OL and AS from SJL/ 
J mice, which are resistant to JHMV as primary targets of infection, can support 
MCV replication, the cell-cell spread of virus due to a fusion-dependent mecha- 
nism is restricted (Knobler et al., 1984). Some deficiency of an SJL/J cell protease 
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was suggested as an explanation for the absence of cell-cell spread of virus. In 
fact, studies on the spontaneous acquisition of resistance by L2 cell mutants, 
selected for the survival of a usually lethal MCV infection, demonstrate that 
partial resistance to infection and to S-mediated syncytia formation are comu- 
table characteristics of the host cell (Daya er al., 1989). Homeostatic control of 
fatty acid and cholesterol metabolism has been shown to be another important 
regulator of virus-mediated cell fusion both in vivo and in vitro (Cervin and 
Anderson, 1991]. 

It has been assumed that differences in the infectability and susceptibility 
of cells to MCV-mediated fusion are correlated (Mizzen et al 1983, 1987b), and 
depend on proteolytic activation of S, presumably involving a host cell protease 
(Frana et al., 1985). Quantitative cleavage of S is, however, not necessary for 
cell-cell fusion, and there is no strict correlation between the proteolysis of S 
and the extent of cell fusion (Frana et al., 1985], as indicated when S was 
mutated in the cleavage signal and expressed independently by vaccinia recom- 
binants (Stauber et al., 1993; Taguchi, 1993). Also, one class of fusion (from 
within) defective MCV mutant, which possesses a single mutation within the 
cleavage signal of S, has been isolated from persistently infected glial cultures 
(Gombold er al., 1993). It is possible that uncleaved S protein is fusogenic, but 
that fusogenicity is enhanced upon proteolytic cleavage. 

CV-induced membrane fusion and the severity of infection are also influ- 
enced by host cell membrane lipid composition (Daya et al., 1988), particularly 
the cholesterol content (Daya et al., 1988). Conceivably there may be a specific 
interaction between peptide sequences of the S protein and cholesterol, in a 
manner similar to that observed with Sendai virus F protein (Asano and Asano, 
1988). Another modulation of MCV infection by cholesterol may pertain to the 
inhibition of Kupffer cell function during in vivo infection (Pereira et al., 1987). 
The observed association of the MCV N protein with cellular membranes may 
be another example of lipid-dependent events during replication. N is synthe- 
sized on free polysomes but associates posttranslationally with membranes, 
possibly as a result of its phosphorylation (Stohlman et al., 1983). Since N binds 
the viral M protein (Sturman et al., 1980) and the M protein is found in associa- 
tion with smooth endoplasmic reticulum (ER) membranes (Rottier and Rose, 
1987), the membrane attachment may be mediated by the M. Nevertheless, N 
protein is able to associate with cellular membranes in the absence of M 
(Anderson and Wong, 1993), raising the possibility that the components in- 
volved are certain membrane phospholipids (Anderson and Wong, 1993). Inter- 
estingly, the ability of N to bind both membrane phospholipids and nucleic acid 
is perhaps analogous to that shown by certain bacterial DNA-binding proteins 
such as DnaA (Sekimizu and Kornberg, 1988) and RecA (Krishna and van de 
Sande, 1990). The possibility, therefore, arises that an association of N with 
viral RNA is influenced by competitive binding to membrane phospholipids. 

The involvement of the cytoskeleton in a number of MCV infections has 
also been investigated. Treatment of L2 fibroblasts with colchicine or cyto- 
chalasin B, drugs that disorganize microtubules and actin filaments, alters cell 
shape without affecting syncytiogenesis or the acute cytopathology (Daya et al., 
1988). By contrast, in peritoneal macrophages exposed to cytochalasin B, the 
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infection is changed from an acute to a persistent type (Mallucci and Edwards, 
1982). Quite possibly, the cytoskeleton plays a role in regulating MCV assembly 
and cytopathology, involving membranes that may vary in a cell-dependent 
manner according to differences in the organization of the cytoskeleton. The 
close association of N of JHMV with microtubules in neurites of cultured 
neurons, recently recognized by Pasick et al (1994), was explained in terms of a 
specific binding by N, via a region of the protein that has a close sequence 
homology to the microtubule-binding site of the cytoskeletal protein tau. 

As mentioned above, the presence of S at the cell surface is related to cell 
fusion (Collins et al., 1982). However, it may also affect control of plasma 
membrane permeability, as evidenced by an influx of sodium ions (Mizzen et 
al., 1987a). This occurs during both acute and persistent infections (MacIntyre 
et al., 1989). However, in contrast to certain other virus infections (Carrasco and 
Smith, 1976), there is evidence that in MCV-infected cells an elevated intra- 
cellular sodium ion concentration affects the regulation of translational control 
(Mizzen et al., 1987a). One should not exclude the possibility that altered 
membrane permeability can affect other aspects of viral cytopathology. 


G. Factors Related to Latency and Persistence 


Infection of the CNS can establish a very prolonged state of CV latency or 
persistence. Whether the animal remains asymptomatic or undergoes a late- 
onset progressive paralytic disease with demyelination, virus expression in the 
form of RNA, antigen, or infectious particles may continue for prolonged pe- 
riods in mice (Pickel et al., 1981; Lavi et al., 1984a,b, 1987; Perlman et al., 1988; 
Knobler et al., 1981) or rats (Wege et al., 1983, 1984; Sorensen et al., 1984b; 
Sorensen and Dales, 1985). Persistent infections are readily established in ex- 
planted primary neural cells (Dubois-Dalq et al., 1982; Beushausen and Dales, 
1985; Wilson et al., 1986; Pearson and Mims, 1985; van Berlo et al., 1989; Lavi et 
al., 1987) and continuous cell lines of neural and nonneural derivation (Flintoff 
and van Dinter, 1989; van Dinter and Flintoff, 1987; Coulter-Mackie et al., 1984, 
1985; Lucas et al., 1977; 1978; Collins and Sorensen, 1986; Leibowitz et al., 
1984). Evidently, conditions prevail in the CNS of postweanling and adult 
rodents, as well as in neural cells in culture, which favor maintenance of the 
state of persistence. Analogous observations have been documented for other 
neurotropic RNA viruses such as measles virus (Miller and Carrigan, 1982; 
Yoshikawa and Yamanouchi, 1984), canine distemper virus (Pearce-Kelling et 
al., 1990; Zurbriggen et al., 1987), rubella virus (van Alstyne and Paty, 1983), 
picorna viruses (Rodriguez et al., 1990), and alpha viruses (Bruce et al., 1984). A 
common feature linking MCV with all of the above agents is an initial low rate 
of replication associated with limited cytopathology. Following a period of 
adaptation, during which, perhaps, variants are selected and maintained, the 
infectious process may remain unchanged or switch into a more acute or more 
attenuated mode (Taguchi and Siddell, 1985) or replication may be entirely 
suppressed (Collins and Sorensen, 1986). The exceptionally high efficiency of 
MCV genetic recombination creates numerous variants (Lai et al., 1985; Ma- 
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kino and Lai, 1989), a phenomenon that can account, at least partially, for the 
varying types of infection encountered, as discussed in Section IV. 

The repeated passage of JHMV in murine fibroblastic DBT cell lines gener- 
ates defective-interfering (DI) particles in which genomes are shortened by 
deletions (Makino et al., 1984, 1985). In a high multiplicity of infection, Dis 
interfere with the transcription of all mRNAs species with the exception of 
mRNA], coding for N. There is no substantial evidence linking Dis with CV 
infections of the CNS or neural cell explants (Taguchi and Siddell, 1985). 

A regular feature of MCV persistence in continuous cell lines, not exclu- 
sively of the neural type, is a wavelike, cyclical production of infectious parti- 
cles (Lucas et al., 1977, 1978; Coulter-Mackie et al., 1984, 1985). Such peri- 
odicity in virus titer also has been reported for measles and rabies viruses 
(reviewed in Lucas et al., 1978) but is not evident during MCV infections of 
primary neural cell cultures (Beushausen and Dales, 1985; Wilson et al., 1986). 
With rabies virus, the rise and fall of titers was explained as due to reciprocal 
periodic development of an antiviral state related to the autocrine activity of 
interferon (IFN) (reviewed in Coulter-Mackie et al., 1985). During persistent 
infection of porcine alveolar macrophages by transmissible gastroenteritis virus 
(TGEV) of pigs, an induction of a-IFN was observed (Laude et al., 1984). By 
contrast, our extensive, detailed search for the induction of IFN during a cycli- 
cal rise and fall of JHMV titers from RN2-2 Schwannoma cells failed to detect 
the presence of this lymphokine, although these host cells have the capacity to 
acquire an antiviral state and make IFN when appropriately induced with 
reovirus (Coulter-Mackie et al., 1985). Likewise, persistent infections of mouse 
neuroblastoma C1300 cells with JHMV (Leibowitz et al., 1984) and human 
glioblastoma U-87 HG cells infected with HCV OC43 (Collins and Sorensen, 
1986) do not evoke the production of IFN, demonstrating that in these systems 
IFN or the antiviral state are not required to maintain persistence. The effec- 
tiveness of IFN as a protective factor may also depend on the MCV strain 
involved. Thus, with DBT cells, which become productively infected, a-IFN 
fails to arrest replication or the development of cytopathology when the highly 
virulent MHV-2 is used, whereas IFN treatment can abrogate both virus produc- 
tion and cell killing when MHV-S or JHMV are used as infectious agents 
(Taguchi and Siddell, 1985). 

Another reversible cellular response to MCV infections of primary neural 
cell explants and continuous cell lines is the suppression of persistent virus 
production that occurs on a shift up of the ambient temperature from a permis- 
sive 32-33?C to the restrictive 39-40?C (Lucas et al., 1977, 1978; Coulter- 
Mackie et al., 1984, 1985; Sorensen et al., 1984a). The MHV3 and JHMV progeny 
generated during permissive infection of L2 cells do not acquire or possess a 
thermosensitive or thermolabile phenotype during passage through lympho- 
cytic (Lamontagne and Dupuy, 1984b) or neural cells (Lucas et al., 1978; 
Leibowitz et al., 1984). Similar results were obtained with measles virus (Lucas 
et al., 1978). This implies that latency at elevated temperature is due to some 
controlling factor(s) in the host cells per se that is not IFN (Coulter-Mackie et 
al., 1985; Leibowitz et al., 1984). By contrast, replication of HCV OC43 in the 
UHS87 HG line generates thermosensitive variants, so that during restriction at 
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high temperature persistent or latent OC43 virus is rapidly eliminated and cells 
are "cured" (Collins and Sorensen, 1986). 

A detailed study of latency was undertaken using the RN2-2 cells infected 
with JHMV (Coulter-Mackie et al 1985). The virus formed during persistent 
infections at 33 °C is not temperature sensitive. Within 24 hr after shift up to 
39 °C, replication is arrested, but the latent genomes are maintained in the host 
cell for prolonged periods and gene expression, measured as translation into N 
protein, can be detected for several days. If the latently infected cells, which 
continue to grow and divide at 39?C, are returned to the permissive tempera- 
ture following 2 to 3 weeks, virus production slowly resumes. Analyses made 
by following a time-course in the decline of infectious centers yielded data 
compatible with a probability model that predicts that due to continual segrega- 
tion of the limited number of latent J[HMV genomes among daughter cells, the 
virus genome is eventually lost by a dilution effect. 


II. RELATIONSHIP BETWEEN DEVELOPMENT AND 
DIFFERENTIATION OF THE CENTRAL NERVOUS SYSTEM 
AND THE DISEASE PROCESS 


A. Characteristics of the Cells Involved 


Under normal circumstances, the suceptibility of rats to CV-induced dis- 
ease extends only to the time of weaning (Sorensen et al., 1980), whereas mice 
remain susceptible to challenge into adulthood. The idea that the tropism of CV 
for specific cells in the CNS can explain the basis of encephalitic and de- 
myelinative paralytic cells in the CNS has already been considered in Section II. 
In the rat model, a temporal coincidence between the completion of myelina- 
tion and the onset of resistance to JHMV drew our attention to OL maturation 
as a possible determinant of resistance to JHMV-induced demyelinating disease 
(Beushausen and Dales, 1985; Wilson et al., 1986; Beushausen et al., 1987). Also, 
the availability of methods for reproducibly culturing cells explanted from the 
CNS now makes it possible to follow the process of differentiation and infec- 
tion among defined types of neural cells. 

Soon after explantation from the brain, a mixture of neurons, cells of glial 
lineage, and other elements are present in culture. The composition of the 
nutrient media, including a serum supplement, can profoundly affect the cell 
types that remain during prolonged cultivation and the cell type selection 
(Fabre et al., 1985; Gard and Pfeiffer, 1989; Muraoka and Takahashi, 1989; Koper 
et al., 1984). Thus, with respect to glial cultures, type 1 AS, identified by the 
presence of GFAP, and the O2A bipotential OL and type 2 AS progenitor cells, 
which have receptors for monoclonal antibody (MAb) A2B5 (and hence are 
designated as A2B5*) (Raff, 1989), are the most numerous elements when neo- 
natal cortex is explanted into nutrient medium with added serum (Beushausen 
and Dales, 1985; Koper et al., 1984). Moreover, the progression of O2A cells into 
AS or OL lineages, respectively, can be reconstructed by following the progeny 
of individual cells in rat optic nerve cultures (Raff, 1989). These elegant analyses 
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demonstrated that, even after their separation from the CNS, glial cells are 
endowed with a predetermined developmental "time-clock" that probably sim- 
ulates the program existing in vivo. The aquisition of GFAP by daughters of 
O2A parents identifies them as type 2 AS. Daughters of O2A expressing myelin 
basic protein (MBP), CNPase activity, and GalC on their surfaces are destined to 
become OL. 

The appearance of the mature OL phenotype is more rapid if cells are 
sparsely distributed in culture dishes (Aloisi et al., 1988) or are grown on poly L 
lysine, which promotes tight adherence to the substrate. Factors promoting 
differentiation favor the onset of resistance to JHMV infection (Beushausen and 
Dales, 1985). Conversion from the O2A into 2AS or OL types is also influenced 
by growth factors such as basic fibroblastic growth factor (bFGF) and platelet- 
derived growth factor (PDGF) (McKinnon et al., 1990; Bógler et al., 1990). For 
example, PDGF, secreted by type 1 AS, promotes the appearance of OL (Raff, 
1989; Levine, 1989). When recently established cultures are switched from a 
serum containing medium to a defined, serumless medium, the survival of 
neurons is promoted and an alteration of A2B5* cells into differentiated OL is 
enhanced (Raff, 1989; Levine 1989; Koper et al., 1984). This suggests that there 
are factors in serum that help to maintain O2A cells but may inhibit the 
survival of neurons. Interactions between neurons and OL are indicated by the 
myelination of neuronal processes in vitro, albeit of a rudimentary type (Mura- 
oka and Takahashi, 1989; Pasick and Dales, 1991). Myelin elaboration in cul- 
tures of OL also has been reported in the absence of neurons, implying that 
axons are not an essential stimulus for triggering myelin formation (Fabre et al., 
1985) 


B. The Infectious Process in Differentiating OL 


If kept for several days, cultures of mixed rat glial cell types contain tightly 
adherent type 1 AS on the bottom and more loosely attached O2A and OL above 
the subjacent AS layer. The mixed glia culture can be infected persistently with 
JHMV and MHV3. After shaking off the top cell layer, type 1 AS are infectable 
exclusively by MHV3. After replanting, the shaken cells can support only 
JHMV replication (Beushausen and Dales, 1985; Wilson et al., 1986). Shaken 
cells, when kept in culture for 2 to 3 weeks, become resistant to JHMV and 
switch from A2B5* into the mature OL phenotype. Shaken cells can be induced 
to differentiate more rapidly and resist JHMV infection by pretreatment with 
N,,O,-dibutyryl 3':5' cAMP (dbcAMP) (see Table I). By contrast, AS treated 
with this metabolite continue to produce abundant quantities of MHV3. Cul- 
tured neurons exposed in the same manner to dbcAMP can be infected persis- 
tently and produce high titers of JHMV (Pasick and Dales, 1991). With other 
neurotropic viruses quite different effects of dbcAMP are observed. Infection of 
AS with rubella virus becomes latent, but it is induced to a productive infection 
by dbcAMP (van Alstyne and Paty, 1983). In contrast, infection of neuronal and 
other cell lines with measles virus is reversibly inhibited by dbcAMP, possibly 
at a late stage in virus expression (Miller and Carrigan, 1982; Yoshikawa and 
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TABLE I. Effect of dbcAMP on the Replication 
of JHMV in Primary Oligodendrocytes? 


Days postinfection 


Treatment of culture 2 3 4 6 8 12 14 
Control 13 46.8 55 38.8 720 31 20 
] mM postinfection 33.4 100° 15.5 100 500 26 30 
1 mM 48 hr before infection 0 0 0 0 0 0 0 
1 mM 48 hr before infection 0 0c 0 0 0 0 0 


NOTE: All titers expressed x 10? PFU/ml multiplicity of infection 0.5-1.0/cell in each case. 
?Adapted from Beushausen and Dales (1985). 

bdbcAMP added. 

dbcAMP removed. 


Yamanouchi, 1984). These observations indicate that dbcAMP effects may be 
different for each type of neural cell and virus tested, even among MCV strains. 
The resistance to JHMV in cells belonging to the OL lineage can be affected 
by metabolites other than dbcAMP, as long as they can act to elevate intracellu- 
lar concentrations of cAMP (Beushausen et al 1987) (Fig. 1). These data indicate 
that adenylate cyclase metabolism is generally involved, perhaps through the 
cAMP-dependent protein kinase[s) (PK). Oligodendrocytes, like brain tissue 
generally, constitutively possess the PK II activity but lack PK I. A striking 
effect of induced OL differentiation is a tenfold increase in the amount of PK I 
regulatory subunit RI despite the absence of constitutive PK I enzyme from OL 
(Beushausen et al 1987). Infection by MCV is arrested in mature OL at some 
stage after internalization of the inoculum but prior to the initiation of genome 
expression. The critical step affected by dbcAMP treatment, or by the normal 
differentiation process, could be the uncoating step, defined here as the separa- 
tion of genome RNA from a phosphorylated form of N constituting the helical 
capsid (Beushausen et al., 1987). Evidence has been obtained showing that 
dephosphorylation of N is followed by its proteolytic breakdown. This step, 
which likely occurs after attachment and uptake, as illustrated in Fig. 2, may be 
catalyzed by a neutral PPPase concentrated in endosomes (Beushausen et al., 
1987; Mohandas and Dales, 1991). A PPPase that is highly specific for N occurs 
in rat brain tissue, OL ROC-1 cells (a hybrid cell line created from OL x C6}, and 
in the highly permissive L2 cell (Mohandas and Dales, 1990). It is remarkable 
that the N-specific, endosomal PPPase activity is inhibited by the addition of RI 
to the reaction, providing evidence for a putative mechanism involved in block- 
ing JHMV infection in OL (Wilson et al 1990). As might be expected from these 
findings, the block to the process of MCV uncoating, relating to the de- 
phosphorylation of N in mature OL, can be circumvented by transfection with 
genomic JHMV RNA (K. Kalicharran and S. Dales, unpublished data). 


C. Immunity as a Factor Regulating Pathogenesis and Disease 


Both cellular and humoral components of immunity modulate the disease 
process due to CNS infections by MCV. Protection against acute, lethal enceph- 
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FIGURE 1. Modulation of R I and R II in primary rat astrocytes and oligodendrocytes during 
treatment with 1 mM dbcAMP. The concentrations of the regulatory subunits in cytosol (100,000 x 
g) fractions from astrocytes and oligodendrocytes were determined by binding of 8-azido-|?2P]- 
cAMP. A densitometer tracing (A) made from an autoradiogram (B), obtained after 10% sodium 
dodecyl sulfate-polyacrylamide gel electrophoresis, enabled a comparison of the time-related 
changes in the R I regulatory subunit. Absorbance units have been normalized to the band of 
greatest density (oligodendrocytes, 5 days posttreatment). (From Beushausen et al., 1987.) 
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FIGURE 2. Diagrammatic representation of the early events connected with MCV uncoating. 
Dephosphorylation of NC protein is proposed as a necessary event for initiation of processing of NC 
to facilitate conformational relaxation of the genome and/or release of NC from the genome to 
allow the expression of subsequent virus-specified functions. 


alitis is achieved by passive immunization with MAb against viral envelope 
glycoproteins S, M, and HE. Such immunization, however, fails to arrest the 
delayed, demyelinating form of disease and may in fact promote it (Fleming et 
al., 1989). Our own studies with rat tetencephalic cultures suggest that the 
subacute disease with demyelination may occur only after survival of the acute, 
encephalitic phase of disease (Pasick and Dales, 1991). An antibody-mediated 
shift in tropism is indicated by the interference with neuronal, but not OL 
infections, when MAb against S or HE is employed (Buchmeier et al., 1984; 
Yokomori et al., 1992). This implies that complement-mediated cytolysis or 
antibody-dependent, cell-mediated cytotoxicity (ADCT) may eliminate in- 
fected targets following the mobilization and infiltration of mononuclear cells. 
In addition, an increase of B and plasma cells as well as IgG+ complement 
deposits within demyelinating lesions of WL rats (Zimprich et al., 1991) is 
consistent with the possible role of ADCT. 

During acute phases of murine CNS infection by JHMY, the T-cell response 
may be crucial in mediating clearance of virus and arrest of the disease pro- 
cess. Upon adoptive transfer of CD4*, Lyt2-cloned T cells specific for JHMV, 
C57BL/6 mice are protected from developing lethal encephalitis by an major 
histocompatability complex II (MHC-II)-restricted delayed type hypersen- 
sitivity (DTH) response (Stohlman et al., 1986, 1988), without a reduction in 
virus titer in the CNS. Such protected mice subsequently manifest OL de- 
myelinating disease. The CNS concentrations of anti-JHMV antibodies are not 
altered in the above T-cell recipients or in WF rats with demyelinating lesions 
(Sorensen et al., 1984b). It can be concluded from these data that the local 
synthesis of antiviral antibodies within the CNS does not prevent demyelina- 
tion. The adoptive transfer of T-cell clones specific for JHMV can also protect 
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against encephalitis by reducing virus replication in neurons (Stohlman et al., 
1986). Furthermore, some data suggest that the clearance of JHMV from the 
CNS of mice appears to involve both CD4* and CD8* T cells (Sussmann et al., 
1989, Williamson and Stohlman, 1990; Yamaguchi et al., 1991). These results are 
in line with the evocation of a CTL response against the carboxy-terminal end 
of N (Stohlman et al., 1992). Another study, however, failed to demonstrate 
cytotoxic lymphocytes (CTLs] against viral components in C57BL/6 mice, 
although other T-helper cell responses were detected (Mobley et al., 1992). 
Complementary studies employing athymic, nude mice and rats or animals 
treated with immunosuppressing agents confirmed the involvement of cellular 
immunity in the disease process. They showed that, relative to control animals, 
gray matter and neuronal involvement is enhanced during T-cell deficiency, 
resulting in acute encephalitic rather than demyelinative, white matter disease 
(Sorensen et al., 1982, 1987b; Zimmer and Dales, 1989; Pasick et al., 1992). 

Postweanling rats challenged with JHMV do not exhibit any disease symp- 
toms, but on histological examination show signs of limited CNS histopathol- 
ogy and some virus expression. The age-limited resistance does not apply to 
athymic nude (nu/nu) rats or animals treated with the immunosuppressants 
cyclophophosphamide and cyclosporin A (Sorensen et al., 1987b; Zimmer and 
Dales, 1989). Rats challenged with virus even beyond 70 days of age develop a 
late-onset encephalitic form of disease that is rapidly fatal. Histopathology 
reveals wide dissemination of infection in neuron-rich gray matter but a limited 
involvement of white matter, where small foci of demyelination develop during 
the late-onset disease. In adult mice of strains that are normally resistant to 
A59, MHV3, and JHMV MCV infections, when initiated by the IN and IP 
routes, immunosuppression promotes a similar, late-onset encephalitis arising 
from extensive neuronal damage (Sorensen et al., 1982, 1984a,b]. It appears that 
CNS neurons in older rats and mice are fully susceptible to infection but 
replication and dissemination of MCV progeny is suppressed when cellular 
immune responses are activated. Presumably, the paucity of susceptible OL 
targets for MCV infection within the CNS of mature animals results in a 
limitation of pathogenesis, manifested as very small WM demyelinating le- 
sions (Sorensen et al., 1987b; Zimmer and Dales, 1989). 

Consistent with the in vivo situation, the susceptibility of neurons can also 
be demonstrated with cortical explants from rat embryos that are maintained in 
long-term culture. Unlike explanted OL, which are intrinsically programmed 
to differentiate within 3 weeks after birth or at an accelerated rate by treatment 
with dbcAMP (Raff, 1989; Beushausen and Dales, 1985], isolated neurons re- 
main susceptible to persistent JHMV infection for long periods of time and can 
produce large quantities of virus after pretreatment with dbcAMP (Pasick and 
Dales, 1991). The late-onset, fulminant infection of neurons in immunosup- 
pressed rats might commence from a reservoir of virus at a foci of latency, 
perhaps in neurons of the cerebellum and hippocampus (Sorensen and Dales, 
1985). 

Unlike cellular immunity, the humoral immune responses that occur 
within the CNS compartment following infection of WF and WL rats fail to 
suppress virus replication or interrupt the development of disease (Sorensen et 
al., 1984b). In Brown Norway rats, a vigorous humoral immunity within the 
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CNS develops after JHMV infection, which could explain why there is a sup- 
pression of disease and virus replication in this model (Dórries et al., 1987). 

The infection of macrophages and lymphotropism of MHV3 during infec- 
tion of partially susceptible (C57BL/6 x A/J)F, mice initially causes a low-grade 
infection of the liver, then of the brain. Upon spreading to the CNS, the infec- 
tion produces paralytic symptoms, sometimes lasting for periods as long as 1 yr. 
Lymphotropism is associated with a progressive decline of immunoglobulin 
production during the initial 3 months, then a return to normal levels (Leray et 
al., 1982). After introduction of JHMV by IN inoculation, adult BALB/c mice 
become protected against challenge for 12 months. This is due to antibodies 
raised against the initial inoculation (Barthold and Smith, 1989). The virus may 
incite an autoimmune disease process in WL rats and SJL/J mice, which follows 
myelin destruction, as described in Section IV. Autoimmunity was postulated 
to involve expression of the MHC-I associated (Ia) determinant on AS when 
these cells become differentiated into antigen-presenting cells due to the pres- 
ence of JHMV (Massa and ter Meulen, 1987; Calder et al 1988). This phenome- 
non does not appear to pertain in mice, in which only class I H-2 antigen 
presentation is observed in AS and OL (Suzumura et al., 1986, 1988). The 
presentation of class I H-2 required continuous presence or formation of viable 
A-59 virus and is mediated by factors produced in glial cells, probably AS (Lavi 
et al., 1989). 


IV. GENETIC VARIABILITY OF VIRUS AND HOST AS 
DETERMINANTS OF PATHOGENESIS AND DISEASE 


Differences in the cytopathology and disease process in animals are greatly 
affected by the genotypes of both the virus and the host. During interactions 
with cell targets, the S peplomer is most probably the major determinant of 
tropism, virulence or attenuation, and cytopathology. The host controls the 
disease process and susceptibility or resistance to infection at the cellular and 
organismic level, including responses by the immune system. 


A. Relationship of the CV Replication Strategy to Genetic Variability 


An understanding of the CV replication strategy can explain why these 
agents have the capacity to evolve rapidly into diverse genetic variants. Thus, 
transcription by means of discontinuous, leader-primed synthesis (Lai, 1986], as 
well as the occurence of subgenomic replicative intermediates (Sethna et al., 
1989; Sawicki and Sawicki, 1990], provide possible mechanisms for highly 
efficient recombination among the nonsegmented CV genomes (Lai et al., 
1985). Based on the currently available CV genomic and mRNA sequences, it is 
possible to explain the genesis of diversity observed among MCV progeny 
following passage through cells in culture or during CNS infection of animals. 
Also, in the case of HCV, circulating among human populations, genetic vari- 
ability could arise by repeated OC43 and 229E reinfections of the upper respira- 
tory tract (MacNaughton, 1982; Reed, 1984). 
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The occasional association of OC43 infections of humans with neurologi- 
cal symptoms (Hellevi and Hovi, 1980) and the induction of a demyelinating 
disease process by MCV in rodents and monkeys have a bearing on the search 
for an etiologic infectious agent in multiple sclerosis (MS). However, the claim 
that two CV isolates, designated SK and SD, had originated from the tissue of 
MS patients (Mendelman ez al., 1983) remains questionable. Sensitive antigen 
detection (Hasony and MacNaughton, 1982) and nucleic acid probes (Weiss, 
1983; Weiss and Leibowitz, 1983) that are able to discriminate between HCV 
and MCV have grouped SK and SD viruses with MCV (Weiss, 1983). 


B. Role of S in Pathogenesis and Disease 


The S peplomer, which is involved in attachment, penetration, syncytio- 
genesis, and other aspects of cytopathology, appears to be one of the most 
variable component of coronavirions. In addition to molecular diversity in S 
from the various CV strains infecting mammals, variability in the MCV S 
protein arises during passage through rodents, generating antigenic and patho- 
genicity variants, as summarized in Table II. An important location for molecu- 
lar modulation could be a hypervariable RNA sequence indentified in the S 
gene (Parker et al., 1989). During replication, this region may either be a pre- 
ferred crossing over site or a site of strong selection, thereby giving rise to 
recombinants of S in which modifications, including deletions, are introduced 
frequently. 

The unquestionable role of S in cytopathogenicity and neuroattenuation 
was demonstrated when S was expressed by means of a vaccinia virus recombi- 
nant vector. This experiment of Daya et al. (1989) demonstrated unequivocally 
that S, by itself, can induce the same cytopathology as the MCV from which it 
was derived. Also, it has been possible to identify separate regions on S associ- 
ated with pathogenicity and other functions (Table II). These various biological 
activities of S were mapped using panels of MAbs able to affect individual 
functions of this G-P. Thus, three topologically distinct antigenic sites, eliciting 
protective cellular or humoral responses in mice, were identified on the S 
protein of JHMV by Talbot et al. (1984). The survival of mice was associated 
with decreased virus production and confinement of the infection to neurons, 
sparing the glia (Buchmeier et al., 1984). 

Another comparative study on the MCV strains MHV-1, A -5, JHM, and 
A59 demonstrated the presence of five antigenic sites, A through E, among 
which B is involved with neurotropism and C with virulence (Talbot and 
Buchmeier, 1985). Such analyses also uncovered polymorphism in the S protein 
of JHMV variants isolated after escaping neutralization by MAb (Dalziel et al., 
1986; Buchmeier et al, 1988). Some isolates obtained in this manner were 
attenuated and others more virulent when compared with the parental JHMV. 
Attenuation was manifested through a requirement for larger inocula to initiate 
CNS disease and a change in preferential tropism from neurons to OL. Thus, a 
paralytic, demyelinating disease rather than an acute encephalitis was elicited 
(Buchmeier et al., 1988). Controlled dual infection with JHMV and A59 has also 
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generated recombinants in the S gene. Epitope mapping carried out with MAb 
demonstrated that the critical determinant(s) for neuropathogenicity occur at 
the C terminal one third of the S protein (Makino et al., 1987; Gallagher et al., 
1990). In some variants, attenuation is linked with a ts phenotype of the virus, 
as in the case of one A59 isolate that replicated at a normal rate in Sac[—) cells 
but behaved as ts in glial cultures and was mildly neuropathogenic in mice [van 
Berlo et al., 1986; Koolen et al., 1983). From this and other findings, it becomes 
evident that a temperature restriction regulating CV infection can be exerted by 
the neural cell host and is not due to thermosensitivity of the virus, as discussed 
in Section II.G (Lucas et al., 1978; van Berlo et al., 1986; Koolen et al., 1983). 

It is possible that some phenotypic variations of S are due to point muta- 
tions alone, but itis known that during passage through neural cells in explants 
or within the CNS, the S protein of the emerging stable variants undergoes 
considerable changes, as described in Table II. One JHMV isolate, passaged in 
C1300 neuroblastoma, possessed a deletion in S and coincidentally became 
nonsyncytiogenic (Lavi et al., 1990). In another JHMV isolate from the CNS that 
could replicate more efficiently in AS, the mRNA encoding S was larger (Tagu- 
chi et al., 1986). In another study of six JHMV isolates from mouse CNS, four 
were shown to possess larger and two smaller than normal length S genes. In the 
smaller S variants, sequence deletions were linked with an attenuated pheno- 
type producing a subacute demyelinating disease in mice (Taguchi and Fleming, 
1989). 

Similar modulations in S of JHMV were generated by passage through the 
CNS of rats. One report described a stable change to a larger S polypeptide 
(Taguchi et al 1985). As described in Table II, genetically stable JHMV variants 
have been isolated and different phenotypes for the induction of neurological 
disease in the white and gray matter of the CNS can be recognized (Baybutt et 
al., 1984; Morris et al., 1989]. One such attenuated isolate from spinal cord (SC), 
termed ATIIf, has a deletion in S, is highly fusogenic for cells in culture, and 
rather specific for inducing SC demyelination (Morris et al., 1989). AT11f over- 
comes the resistance of explanted glial cells in adult SJL/J mice, apparently, 
unlike infection with parental JHMV, because dissemination of the AT11f prog- 
eny is not as restricted and high titers are produced (Pasick et al 1992). Thus, an 
alteration in S has made this isolate more, not less, virulent for mouse neural 
cells. A double mutant attenuated in both encephalitic and demyelinating 
potential contained a point mutation within S2 plus a deletion in the hypervari- 
able region of SI (Fleming et al 1986, 1987). This finding led to the conclusion 
that the role of S in demyelination may involve multiple determinants on this 
molecule (Wang et al., 1982]. Evidently, care must be taken in defining attenua- 
tion and virulence as it pertains to diseases produced by these variants. 


C. Influence of the Genetic Constitution of the Host 
There is abundant documentation that the resistance or susceptibility to 


MCYV-induced disease process is controlled by the genetic determinants of the 
host. With mice, the determinants have been recognized at the cellular level. 


(9861) Tp 12 
tynge (1661) 
[V 13 e1eqns3eyq 
(S861) T" 19 
TyonseL (1661) 
TU 12 eITeqns}e wW 


(T661) Tv 12 SueM 
9961) 7P 12 Burma, J 
(T661) TP 12 SUEM 
(9861) 7P? 12 Surumi 
(C661) T? 12 Sue M 
9961) 7P? 12 Surumi 


(1661) 
‘JD 12 eItUOWN EI 


'l6861) T? 12 SUIOW 


OAIA 
UI o9uos[nirtAognou poouequo ‘OIA 
ut $23K2011s* 10} urstdon poouequgq 
S3€1 SIMI] Sur[U£2A UI 10201 
-QA[OAUI [e1[8 pue [euognou peo1ds 
-9pI^ uA oouo[nitAoInou poouequgd 


penuazod Sunrvur[oKurop pue on 
-rTeudəouə y30q 10} poyenus}}eOININ 


jeuejod orueSo31[eudoouo surejoq 
əseəsrp Surjeutr[oKurop 
93noeqns soonput ‘po}enus}}eOININ 


əseəsrp Suryeur[oAud 
-9p anode soonput 'pojenuo33eoInoN 


C-[9 ut punoj S jo USUI 
-os1e[ua [poruopr JOU YSNoY} 1e[rurs 


uto30Jd s jo sjuoureSre[uo PGY-ST 
çS Ul OVE? 
opriooponu 3e uorjenui 3utod sn[d 
uot821 2I0EUEAIadA [S ut pc9T 0} 

€csTI sopnoo[onu Suruueds uonələG 
TS ut OS6T pure 9971 

sopnoo[onu e suonejnui 3utod OMI, 
yuəur38əs CS ut OPEE 

opnoo[onu e UOTIe NUT 3utod ə[38uts 
əuə3 
AH 201 ur uonə]əp əpinuoə|onu-gç/ 
(aas [S JO UOTSII o[qe eA 

-IadAY ur uono[op opnoosponu-Tppt 


s][99 
[ernəu e1 poni 
-[h2 wor} po1e[os 


sni[eudoouo ə1noe 
(LA SIM27 JO ute1q 


929U?3SIS91 UOT} 
-ezi[er;nou o[qno(T 
qV]A 0} Vue) 
-SISOI UOH?ZI[E3n9 NI 
GVW O3 92ut3 
-SISOI ZUIZIJEIMƏN 
sni[oKui 
-o[eudoouo Ze 
-I[oKuTop YIM 231 
IM JO p109 [eutdg 


ASNO 


C19 


TAT LITT 


TA-TZ 


TATT 


p109 JT LIV 


AWHÍ 





$329U2313J21I 


poA1ssqo ÁSo[ou1edoinaN 


9[nos[ous əy} ut UOT168191[V 


UOT)29[9S JO stseg 


JUEPH?PA 


UIEI)S ShIIA 
[e3uəreq 





paseasiq| SNO pue AWHÍ Jo S ut sasueyD uəəA1əq diysuonepPYy `TI ATAVL 


‘(ssoid ur) satec pue »prseq wo poidepyp 





(C861) 
[V 12 19|qou d 


(8461) T? 12 deet 


(6861) T" 12 

1ex1eq ‘(066T) 

[V 12 19u3e][e> 
(9861) T" 1? przed 


(8861) ID 12 esse 


(6461) Tp 72 qqoy 


uonei[gul [22 Á1o3eururep 
-UI paaeroosse JNOYIIM oseasrp Sut 
-jeur[oAUurop INIL ‘payenua}}BOININ 


uisidon 

[£uo1nou jo sso[ auoa1edde Goen 

-[gu1 [[29 Á103eurure gut 23e1o9pour 

(31A po3eroosse uoneul[oÁAUurop 
(ect pəseəzour !po1enuo131e01n2NI 

so1en 

-[gur Á103eurureput YIM ƏSEƏSID 

Surjeur[oÁurop 'so3A2013s* JO JUJU 

-9A[OAUI 3UPUTUIOD2Id (IA so1nj 
-[n9 TIET 381 jo SUOTIDAFUI 3u21sts19q 

uoneurp[oÁur 

-əp pue srirguruour soonpur ‘SIHTE 
-udəouə 3notnrA uor?23jur JEUOINƏN 


I94T1 0} S8ZI sSepnoo[onu BSUTATOA 
-UI [S ut uonos[op opnoo[onu-//p 


ETZI 03 gert Sepnoo[onu SurA[oA 
-UI [S ur uonə]əp opnoo[»nu-9zp 


9Ane32u VNDI 


yueynui SI SIS 
-9UdSEINUI ]eorurouO 


GVW S 03 ?2ut3 
-SISOI UOTIEZITEIININ 


qVW S 91 22ut1 
-SIS31 UOD?ZI[E3n9 NI 


JUPINUI S} SIS 
-ouogejnul TEL 


JUEINUI S] SIS 
-ouogejnul |eorurəuO 


gei 


€ TIVA 


Tel VSA 


eps} 


€N 


V-AHIN 


FA IW 


280 SAMUEL DALES AND ROBERT ANDERSON 


Notably, SJL/J mice resist infection by JHMV but not by the related MHV3. This 
distinction between virus strains becomes clear when different primary cell 
types are challenged, including neurons, macrophages (Knobler et al., 1981], AS, 
and OL (Wilson and Dales, 1988). In contrast to SJL/J mice, A/J adults and their 
peritoneal exudate macrophages are resistant to MHV3. By challenging IN the 
hybrid mice crosses SJL/J x CD.1 and A/J x DBA and explanted cells from these 
crosses, it can be demonstrated (as shown in Table III) that resistance is mani- 
fested as a recessive trait (reviewed in Wilson and Dales, 1988). This result is 
consistent with the idea that a specific, cellular protease is required to activate 
S by cleavage to S, and S,, and thus promote the spread of infection. However, 
the disease pattern produced with the virus when inoculated into F, mice is 
unlike that expected from fully susceptible mouse strains. For example, IP 
challenge of the F, A/J x DBA cross with MHV3, rather than inducing a fulmi- 
nant visceral and liver disease, instead elicits late-onset neurological symptoms 
with paralysis (Sorensen et al., 1982; Le Prévost et al., 1975). Evidently some 
additional factor(s} is controlling virus expression and influencing the in vivo 
disease process. 

It has been suggested that any variable that influences survival of the 
animal host, following virally induced encephalitis, results in the subsequent 
manifestation of a demyelinating form of disease (Fleming et al., 1989}. Our 
own studies indicate that subacute demyelination may occur after the host has 
undergone a transitory and limiting form of encephalitis (Pasick and Dales, 
1991). 

Genetic variability of the cellular immune response could be one of such 
factor determining the outcome of the disease process. Peritoneal macrophages 


TABLE III. Comparison of Replication in Glial Cultures from Purebred and 








Hybrid Mice?’ 
Mouse strain 
from which Time after Titer (102 PFU/mljc 
culture inoculation 
originated (hr) JHMV* MHV34 A594 
SJL 12 0.17 + 0.29 (3) 590 + 114 (3) 6 + 2 (6) 
24 23 + 12 (14) 435 + 292 (14) 13 + 0.5 (6) 
48 48 + 20 (14) 886 + 62 (10 2 + 0.6 (3) 
72 22 + 9 (12) 475 + 119 (6) ND 
CD.1 12 350 + 216 (5) 77 + 12 (4) ND 
24 3,260 + 2,110 (13) 778 + 30 (11) 21,300 + 19,600 (3) 
48 9,400 + 500 (9) 3,340 + 2,300 (7) 95,800 + 2,700 (3) 
(CD.1 x SJLJF, 24 961 + 34 (5) 1,120 + 730 (5) ND 
48 6,260 + 4,320 (5) 2,400 + 2,000 (5) ND 
72, 943 + 480 (5V 213 + 52 (5V ND 





?From Wilson and Dales (1988). 

bPrimarily oligodendrocytes and astrocytes (35). 

"The values are means with standard deviations. The number of cultures tested is shown in parentheses. 
dMOI, 1 PFU per cell. 

*ND, not determined. 

fDecrease in titers was attributed to rapid cell killing. 
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from a substrain of C,H mice, normally resistant to MCV, became susceptible if 
the host had been pretreated with hydrocortisone, implying that a lymphokine- 
induced resistance was involved (Taylor et al., 1981). Infection of susceptible 
C57BL/6 mice with MHV3 may abrogate or depress cellular immunity directly 
because the virus is lymphotropic, hence immunosuppressive, and causes the 
spleen and thymus to atrophy (Lamontagne et al., 1989). The role of macro- 
phages in the MHV3 disease process can be demonstrated in the adoptive 
transfer of resistance by means of bone marrow cells from resistant into suscep- 
tible X-irradiated mouse recipients (Lamontagne et al., 1989; Dupuy et al., 
1984). 

Among rats, strain WF has a more prolonged age-related susceptibility to 
inoculation with JHMV than WL and other strains (Sorensen et al., 1982). The 
tropism of this virus for immature OL in vitro (Beushausen and Dales, 1985; 
Wilson et al., 1986; Beushausen et al., 1987) and the documented, extended 
duration of susceptibility to JHMV in postnatal, preweanling WF rats may be 
connected. Since the maturation of the CNS in this strain is slower, presumably 
due to genetic deficiency of circulating growth hormone (reviewed in Sorensen 
et al., 1980], a greater number of infectable OL progenitors might be present for a 
longer duration in WF versus WL rats. As is the case with the mouse, rats also 
manifest genetic variation of immune responses, which are superimposable on 
virus-target cell interactions. An autoimmune type of disease, similar to exper- 
imental allergic encephalomyelitis (EAE), was recognized following JHMV in- 
fection of WL rats (Watanabe et al., 1983; Wege et al., 1984). This strain is readily 
inducible for EAE by inoculation with myelin components. However, despite 
similarities in appearances of demyelinating lesions evident in WL and WF rats, 
EAE cannot be induced with myelin antigens in WF rats (Zimmer and Dales, 
1989). These combined data indicate that JHMV infection of WL rats may elicit 
a primary demyelination due to OL infection, as well as an EAE-like disease, 
provoked by circulating myelin antigens released by cytolysis of infected OL. 
This duality of disease processes may occur in WL rats, perhaps because they 
are endowed with a high frequency of T-cell responders to myelin basic protein. 

In rats, naturally or deliberately made deficient in cellular immunity, 
susceptibility to JHMV-related CNS disease becomes extended indefinitely 
beyond weaning. Athymic nu/nu rats undergo a late-onset acute encephalitis 
with only a limited white matter involvement, evident by the presence of small 
lesions (Sorensen et al., 1987b). The disease process is primarily connected with 
fulminant gray matter neuronal infections, which cannot be kept in check due 
to a genetic deficiency of cellular immunity. Humoral immune responses, 
although they do occur locally within the CNS of JHMV-infected WL and WF 
rats, appear not to influence the demyelinative disease process (Sorensen et al., 
1984b; Dórries et al., 1986). On the other hand, a vigorous humoral immunity 
detected in the CNS of BN rats could explain why the disease is suppressed in 
this strain (Dórries et al., 1987). 

From the above considerations, it is evident that genetic variability of the 
virus and the host can result in differing diseases in animals and varying 
cytopathic responses when infections are carried out in vitro. Within the ani- 
mal there are additional important factors, most notably the immune system, 
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that also determine the nature of pathogenesis and type of disease that de- 
velops. 


V. SUMMARY 


Based on the information reviewed in this chapter, it becomes clear that 
both the virus and host are involved in determining the outcome of CNS 
infections by MCV. The current information is consistent with the idea that 
virus tropism for specific cell types within the CNS is profoundly influenced 
by: (1) the state of maturation of the CNS, i.e, by cell lineages and parameters 
controlling differentiation; (2) immune responses; and (3) possibly, evolution of 
genetic variants of the virus. Thus, pathogenesis and neurological disease in 
this complex rodent model of virus infection become a manifestation of an 
interplay between the above three, equally important, parameters. 
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CHAPTER 14 


Feline Infectious Peritonitis 


RAOUL J. DE GROOT AND MARIAN C. HORZINEK 


I. INTRODUCTION 


Feline infectious peritonitis (FIP) is one of the most intriguing diseases caused 
by a coronavirus. It involves immune-mediated phenomena such as antibody- 
dependent enhancement of virus infection- and immune complex-induced pa- 
thology. Furthermore, there is increasing evidence for the existence of a carrier 
state. FIP and feline coronaviruses have been extensively reviewed (Pedersen, 
1976b, 1983a,b, 1987a; Barlough and Stoddart, 1986, 1990; Olsen, 1993). Most of 
these reviews emphasized the pathology, epidemiology, and classical virology. 
During the last 10 years, our knowledge of the molecular biology of feline 
coronaviruses has increased considerably. We will summarize clinicopathologi- 
cal findings and the history of FIP research only briefly. The focus will be on the 
molecular aspects of feline coronaviruses and of FIP pathogenesis. 


Il. CLINICAL SIGNS AND PATHOLOGY 


FIP is a progressive, debilitating lethal disease of domestic and wild Felidae. 
The disease is characterized by disseminated perivascular pyogranulomatous 
inflammation and exudative fibrinous serositis in the abdominal and thoracic 
cavities. The initial signs of naturally occurring FIP are not very characteristic. 
The affected cats show anorexia, chronic fever, and malaise. Occasionally, 
ocular and neurological disorders occur. In classical “wet” or effusive FIP these 
signs are accompanied by a gradual abdominal distension due to the accumula- 
tion of a viscous yellow ascitic fluid. The quantity of fluid can vary from a few 
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milliliters to well over a liter. There is also the "dry" or noneffusive form of FIP 
where little or no exudate is present. The wet and dry forms of FIP are different 
manifestations of the same infection (Montali and Strandberg, 1972). Gross FIP 
lesions appear as multiple grayish-white nodules («1 to 10 mm) in the serosal 
membranes, liver, lungs, spleen, omentum, intestines, and kidneys (Wolfe and 
Griesemer, 1971; Montali and Strandberg, 1972). Microscopic lesions consist of 
disseminated foci of necrosis and pyogranulomatous inflammation, frequently 
located around smaller vessels. These lesions are characterized by accumula- 
tions of fibrin and necrotic debris and by perivascular infiltrations of macro- 
phages, neutrophils, and lymphocytes (Wolfe and Griesemer, 1971; Montali and 
Strandberg, 1972; Hayashi et al., 1977; Weiss et al 1980). 


II. DISCOVERY AND EARLY STUDIES 


The early 1960s are usually quoted as the period when FIP was first recog- 
nized (Holzworth, 1963; Feldmann and Jortner, 1964). However, the disease was 
probably seen earlier. In 1912/13, a case of conspicuous abdominal distension 
due to ascites formation in a domestic cat was reported (Jakob, 1914]; the 
retrospective diagnosis is supported by the description of fever, high specific 
gravity of the abdominal fluid containing many granulocytes, and ophthalmo- 
logical signs in this animal (Fig. 1). 

The infectious nature of FIP was established by Wolfe and Griesemer 
(1966). These authors showed that FIP could be produced in specific pathogen- 
free cats by intraperitoneal inoculation of ascitic fluid collected from diseased 
cats. Zook et al. (1968) and Ward et al. (1968) were the first to present evidence 
supporting a viral etiology. Filtration studies indicated that the causative agent 
could pass 200-nm pores. In areas of inflammation, viral particles were observed 
by electron microscopy within or budding into the endoplasmic reticulum of 
macrophagelike cells (Zook et al., 1968; Ward et al., 1968; Ward, 1970; Pedersen, 
1976a}. The viral etiology was formally proven by transmission experiments 
using virus grown in autochthonous peritoneal macrophage cultures (Pedersen, 
1976a) and virus suspensions purified by density gradient techniques (Horzinek 
et al., 1977). 

The morphology and morphogenesis of FIP virus (FIPV) was typical of a 
coronavirus (Zook et al., 1968; Ward, 1970). Serological studies revealed an 
antigenic relationship between FIPV and established members of the Corona- 
viridae family, such as porcine transmissible gastroenteritis virus (TGEV|, ca- 
nine coronavirus (CCV}, and the human coronavirus (HCV) 229E (Reynolds et 
al., 1977; Witte et al., 1977; Pedersen et al., 1978; Horzinek et al., 1982). In fact, 
CCV, TGEV, and FIPV are antigenically so similar that they may be regarded as 
host range mutants rather than as separate species (Horzinek et al., 1982). 
Consistent with this view, inoculation of FIPV causes TGE-like lesions in the 
small intestine of piglets (Woods et al., 1981). Furthermore, FIP-like pyogran- 
ulomatous lesions were found in cats experimentally infected with the CCV 
strain Insvac-1 (McArdle et al., 1992). Another coronavirus of pigs, the porcine 
epidemic diarrhea virus (PEDV), was found to be antigenically related to FIPV; 
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FIGURE 1. Probably the earliest case of FIP 
documented in the literature (at the State 
Veterinary School in Utrecht, 1912/13]; the 
retrospective diagnosis is plausible from the 
description of a chronic exudative peritoni- 
tis, dyspnea (pleuritis?], fever, and eye symp- 
toms (Jakob, 1914). 





serological cross-reactions were confined to the nucleocapsid protein (Zhou et 
al., 1988). 

The isolation of feline coronaviruses (FCoVs) from naturally infected cats 
is notoriously difficult, probably due to their poor in vitro growth; FCoV strains 
isolated so far are listed in Table I. Since initial attempts to grow the virus in 
embryonated eggs, primary cells, and continuous cell lines were not successful, 
FCoV strains were adapted to and propagated in the brains of suckling mice, 
rats, and hamsters (Osterhaus et al., 1978a,b). In vitro propagation was first 
reported in macrophage cultures (Pedersen, 1976a). A breakthrough was the 
finding that FCoVs could be grown in continuous lines and primary cultures of 
feline fetal cells (O'Reilly et al., 1979; Black, 1980; Evermann et al., 1981; 
Pedersen et al., 1981a). Currently, FCoVs are mostly grown in Crandell feline 
kidney (CrFK) cells, feline embryonic lung (FEL) cells (O'Reilly et al. 1979], and 
the line of Felis catus whole fetus (fcwf) cells developed by Niels Pedersen; the 
latter exhibit macrophagelike features at early passage levels (Jacobse Geels and 
Horzinek, 1983). 


Strain 


FIPV UCD) (NW1) 
FIPV UCD2 

FIPV UCD3 

FIPV UCD4 

FIPV UCD5 

FIPV UCD6 

FECV UCD 

FIPV TN-406 

FIPV Yayoi 

FIPV Dahlberg 
FIPV KU-2 

FIPV 79-1146 

FIPV NOR15 (DF2) 
FECV 79-1683 
FIPV Cornell-1 
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TABLE I. Feline Coronavirus Isolates 


Presumptive serotype 


n~ jq ` kel ` kel ON ` zf pq ce kel ` Fe 


Reference 


Pedersen et al. (1981a| 

Pedersen and Floyd (1985) 

Pedersen and Floyd (1985) 

Pedersen and Floyd (1985) 

Pedersen [personal communication| 
Pedersen [personal communication) 
Pedersen et al. (1981b] 

Black (1980) 

Hayashi et al. (1981) 

Osterhaus et al. (1978a) 

Hohdatsu et al. (1991b) 

McKeirnan et al. (1981) 

Everman et al. (1981) 

McKeirnan et al. (1981) 

Scott (1987) 


FIPV KU-1 II 
FIPV Wellcome 2 


Hohdatsu et al. (1991b) 
O'Reilly et al. (1979) 


IV. EPIZOOTIOLOGY AND EPIDEMIOLOGY 


Although FIP is a disease mainly of domestic cats, it has been reported in 
several wild felids, such as the lion (Colby and Low, 1970], leopard (Tuch et al., 
1974), European wildcats (Watt et al., 1993), caracal, and lynx (Poelma et al., 
1971). The cheetah (Acinonyx jubatus) is extremely sensitive FIP, a peculiarity 
attributed to the genetic uniformity in this species (O’Brien et al., 1985). 

FIP occurs in domestic cats of both sexes at about the same frequency 
(Pedersen, 1976c). The disease occurs most commonly in young cats between 6 
months and 2 years of age. In cats between 5 and 13 years of age, FIP is less 
prevalent; but there appears to be an increased incidence in cats older than 14 
years (Pedersen, 1976c, 1983a,b; Addie and Jarrett, 1992a). Close contact be- 
tween cats is required for effective transmission. Virus is shed from the oro- 
pharynx and in the feces (Stoddart et al., 1988a,b). Therefore, the infection most 
likely results from ingestion and/or inhalation of the virus. 

Epidemiological studies showed that FCoVs have a worldwide distribution 
(Horzinek and Osterhaus, 1979) and are widespread in the cat population. 
Antibodies directed against FIPV or a closely related coronavirus are found in 80 
to 9096 of the cats in catteries and in 10 to 50% of the cats in single cat 
households (Pedersen, 1976c; Loeffler et al., 1978; Sparkes et al., 1991, 1992; 
Addie and Jarrett, 1992a,b). The disease itself, however, occurs sporadically: 
only 5 to 1096 of the seropositive cats actually develop FIP (Pedersen, 1976b; 
Addie and Jarrett, 1992a,b). These findings seem at odds with the fact that 
certain FIPV strains such as 79-1146 cause FIP in almost 100% of the experimen- 
tally infected cats. To explain this discrepancy, it was initially proposed that 
most cats in the field become infected with an avirulent virus closely related to 
FIPV. The isolation of two strains, UCD (Pedersen et al., 1981b) and 79-1683 
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(McKeirnan et al., 1981), that only cause a mild enteric infection but no FIP 
seemed to support this view. These avirulent viruses were designated feline 
enteric coronaviruses (FECVs] (Pedersen et al., 1981b, 1984; Pedersen, 19834). 
There is now, however, ample evidence that FECV and FIPV are merely viru- 
lence variants of the same virus. The FECV isolates are antigenically (Pedersen 
et al., 1983; Boyle et al., 1984; Fiscus and Teramoto, 1987a; Hohdatsu et al., 
1991a,b; W. V. Corapi, personal communication) and genetically (Herrewegh, 
Horzinek, Rottier, and de Groot, in preparation; see Section VI) indistinguish- 
able from FIPV strains. The antigenic differences between FIPV 79-1146 and 
FECV 79-1683 (Fiscus et al., 1987; Hohdatsu et al., 1991b) simply reflect FCoV 
strain variations and do not correlate with their respective pathogenic proper- 
ties. FECV 79-1683 is antigenically (Pedersen et al., 1984; Hohdatsu et al., 
1991a,b; W. V. Corapi, personal communication) and genetically (Herrewegh, 
Vennema, Horzinek, Rottier, and de Groot, in preparation; see Section VI) more 
similar to FIPV 79-1146 than to FECV strain UCD. Moreover, attenuated FIPV 
strains with properties similar to FECV are readily obtained (Pedersen and 
Black, 1983; Barlough and Stoddart, 1990), and it is now recognized that even 
FIP-inducing strains vary dramatically in their pathogenicity (Pedersen and 
Floyd, 1985; Pedersen, 1987b}. In the field, highly virulent strains may be as 
uncommon as completely avirulent strains. It is therefore more appropriate to 
consider all strains under the general category of FCoV (Barlough and Stoddart, 
1990; Addie and Jarrett, 1992b). 

The outcome of an FCoV-infection not only depends on the virus strain but 
also on the infective dose and the route of inoculation (Pedersen et al., 1981a; 
Pedersen and Floyd, 1985), the age of the host, concurrent viral infections [e.g., 
with feline leukemia virus (FeLV)] (Hardy and Hurvitz, 1971; Cotter et al., 1975, 
Pedersen and Floyd, 1985; Pedersen, 1987b], "stress" (Pedersen, 1976b|, and, 
possibly, the genetic predisposition of the host (O’Brien et al 1985; Addie and 
Jarrett, 1992a,b). The combined data suggest that FIP is an infrequent manifesta- 
tion of a common, inapparent infection (Loeffler et al 1978; Horzinek and 
Osterhaus, 1979; Pedersen, 1976b; Addie and Jarrett, 1992a,b). From an evolu- 
tionary point of view, viruses that cause low morbidity and low mortality have 
a survival advantage, especially if they can establish persistent infections. 
There is increasing evidence for the occurrence of asymptomatic carriers of 
FCoV. Kittens exposed to healthy, FCoV-seropositive cats seroconvert within 2 
to 10 weeks (Pedersen et al., 1981b; Addie and Jarrett, 1992a), a number of these 
kittens subsequently develop FIP. Strong support for the existence of a FCoV 
carrier state stems from a series of experiments in which FIPV-immune cats 
were immunosuppressed by superinfection with FeLV. Thus, FIP could be in- 
duced in cats that were kept in strict isolation up to 4 months after exposure to 
FIPV (Pedersen and Floyd, 1985; Pedersen, 1987b). The reverse transcription- 
polymerase chain reaction (RT-PCR) detection of FCoV in the feces and plasma 
of healthy, seropositive cats (Herrewegh et al., 1995) provides further support 
for a FCoV carrier state. 

The epidemiological findings fit a model in which most FCoVs, though in 
principle pathogenic, behave as innocent commensals. The host’s immune 
system [presumably the cellular arm) (Pedersen, 1987b) keeps the infection in 
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check but is unable to clear the virus completely. Thus, upon natural infection 
disease would only ensue under exceptional circumstances, i.e., when (1) the 
host is exposed to a high virus dose, (2) the host is unable to develop a protective 
immune response, or (3) an immunocompetent host is exposed to factors that 
compromise its immune system. 


V. MOLECULAR BIOLOGY OF FCoV 


As for most other coronaviruses, FCoV virions are composed of three main 
structural protein species: a 45-kDa nucleocapsid protein N, a 25- to 30-kDa 
matrix protein M, and a 180- to 210-kDa peplomer protein S (Horzinek et al., 
1982; Boyle et al., 1984; de Groot et al., 1987a). Virions of TGEV also contain a 
small membrane protein (sM) of 10 kDa (Godet et al., 1992). Recent data ob- 
tained in our laboratory indicate that sM is present in FCoV virions as well 
(Vennema, Godeke, Horzinek, and Rottier, in preparation). Further details on 
the characteristics of the structural proteins and coronavirus assembly will be 
presented elsewhere in this volume (Chapters 5-9). 

The FCoV genome is a positive-stranded RNA molecule with an estimated 
length of about 30 kb. Thus far, the FCoV strain best characterized at the 
molecular level is FIPV 79-1146. Initially, six virus-specific, poly(A]-containing 
RNAs were found in infected cells, with lengths ranging from 1.4 to more than 
20 kb. An identical set of RNAs was found in cells infected with FIPV strain 
NORI5 (de Groot et al., 1987a). Recently, we have obtained evidence for a 
seventh virus-specific RNA species (see below}. 

By in vitro translation, subgenomic RNAs of 3.8 and 2.8 kb were shown to 
encode the M and N protein, respectively (de Groot et al., 1987a). Sequence 
analysis of cDNA clones (de Groot et al., 1987b,c) identified the 9.6-kb RNA 
species to encode the S protein. Of the FIPV 79-1146 genome, the nucleotide 
sequence downstream of the polymerase gene has now been completed (de 
Groot et al., 1987b, 1988; Vennema, de Groot, and Spaan, unpublished results). 
The gene organization is identical to that of CCV strain Insavc-1 (Horsburgh et 
al., 1992). In addition to the genes encoding N, M, S, and sM, there are five open 
reading frames (ORFs) (Fig. 2). The sequence 5' AACUAAAC 3' is interspersed 
between the ORFs; the complementary sequence, 3’ GUUUAGUU 3’, is 
thought to function as a promoter for subgenomic RNA synthesis (see Chap- 
ter 2). 

Interestingly, the sM gene is preceded by the partial promoter sequence 3’ 
GUUUAG 5’ (Vennema, de Groot, and Spaan, unpublished results), suggesting 
that it may be translated from a separate, hitherto undetected RNA species (de 
Groot et al., 1987a, 1988). Upon separation of FIPV RNAs in 1.5% agarose gels, a 
minor 4.0-kb RNA species indeed was found. The identity of this RNA was 
confirmed by RT-PCR using oligonucleotide primers located in ORF4 and the 
FIPV leader RNA, followed by sequence analysis (Mijnes and de Groot, unpub- 
lished results). The mRNA for sM is made in very low amounts, and in 1% 
agarose gels it is masked by the far more predominant 3.8-kb RNA species. In 
view of these data it seems appropriate to change the nomenclature of the FIPV 
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FIGURE 2. Organization and expression of the genome of FIPV strain 79-1146. Open reading frames 
are indicated by boxes. The structure of the seven FIPV RNAs is also shown. 


RNAs as indicated in Fig. 2. The ORFs at the 3’ end of the FIPV genome should 
thus be referred to as 7a and 7b (formerly 6a and 6b), according to nomenclature 
recommended by the Coronavirus Study Group (Cavanagh et al., 1990). 

As illustrated in Fig. 2, RNAs 3 and 7 are potentially polycistronic. ORFs 
3a-c and ORFs 7a and 7b are thought to encode nonstructural proteins. It is still 
unclear, however, whether all of these ORFs, in particular ORFs 3b and c, are 
actually expressed (Horsburgh et al., 1992). The ORF7a homologue of TGEV 
contains a 69 nucleotide (nt) deletion compared to ORF7a of FCoV and CCV (de 
Groot et al., 1988; Vennema et al., 1992b; Horsburgh et al., 1992). Its product, a 
9.1-kDa hydrophobic protein, has been detected in TGEV-infected cells (Garwes 
et al., 1989; Tung et al., 1992). It therefore is likely that ORF7a is expressed in 
FCoV-infected cells as well. 

ORF7b is present only in the genomes of FCoV and CCV (Kapke and Brian, 
1986; de Groot et al., 1988; Vennema et al., 1992b; see Chapter 3). It codes for a 
nonvirion secretory glycoprotein, gp7b (Vennema et al., 1992a). The glycopro- 
tein gp7b is not required for virus replication in vivo and in vitro: FECV strain 
79-1683 contains a large deletion in ORF7b (Vennema et al., 1992b), but repli- 
cates to high titers in tissue culture cells and causes enteritis upon experimen- 
tal infection of cats. Also, ORF7b is readily lost upon in vitro passage of other 
FCoV strains: large deletions in this gene were found in high passage stocks of 
strains TN406 and UCD4, in UCD2 and in the DF2-derived Primucell vaccine 
strain (Herrewegh, Vennema, Horzinek, Rottier, and de Groot, in preparation). 
Still, with the exception of strain 79-1683, an intact ORF7b is present, at least at 
low passage numbers, in all nine FCoV laboratory strains studied thus far, 
including the nonvirulent FECV strain UCD. Moreover, no deletions were 
found in ORF7b of 16 field strains as determined by RT-PCR analysis on fecal 
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samples and tissues (Herrewegh et al., 1995). It therefore appears that although 
the 7b gene is dispensible for replication in tissue culture cells and enterocytes, 
its presence provides a selective advantage during natural infection. 

The function of gp7b remains enigmatic. In the infected cell, the protein is 
present only in the lumen of the endoplasmic reticulum (ER) and the Golgi 
(Vennema et al 1992a). An involvement in cytoplasmic processes such as RNA 
replication can therefore be excluded. Presumably, gp7b exerts its function 
extracellularly, perhaps as a virus-encoded modifier of the immune response 
and/or inflammatory reaction (Gooding, 1992). The loss of gp7b may well 
contribute to the reduced virulence of FECV 79-1683, high passage TN406 
(Pedersen and Black, 1983), FIPV UCD2, and the Primucell vaccine strain. 


VI. SEROLOGICAL AND GENETIC RELATIONSHIPS OF FCoV 
ISOLATES 


As first noted by Pedersen et al. (1983), FCoVs can be allocated to at least 
two serotypes on the basis of in vitro neutralization. Sera from cats experimen- 
tally infected with so-called type I FCoVs (FIPV TN-406 or FECV UCD) neu- 
tralized other type I FCoVs but not type II FCoVs (79-1146 and 79-1683} and vice 
versa. The type II strains appeared to be more closely related to CCV and TGEV, 
i.e., immunodominant neutralization epitopes shared by the spike proteins of 
TGEV, CCV, and the type II FCoV strains seemed to be absent in the type I 
strains (Pedersen et al., 1983). These findings were confirmed and extended in 
studies using monoclonal antibodies (MAbs) (Hohdatsu et al., 1991a,b, 1992; 
W. V. Corapi, personal communication). Strains DF-2 (NORI5}, CUI and KU-1 
were tentatively identified as type II viruses, whereas UCD-1, -2, -3, and -4, and 
Yayoi and KU-2 were assigned to serotype I (Table I) (Hohdatsu et al., 1991a,b; 
W. V. Corapi, personal communication). Strain UCD-2 is interesting in that a 
number of epitopes on the spike protein, shared by other group I strains, are 
absent (Fiscus and Teramoto, 1987a,b; W. V. Corapi, personal communication). 
This led Fiscus and Teramoto to suggest that UCD-2 represents a separate 
antigenic type (Fiscus and Teramoto, 19874]. 

There is some confusion in regard to the classification of FCoV strain 
UCDI. According to some reports, UCDI belongs to type I (Pedersen et al., 1983; 
Hohdatsu et al., 1991a,b). Other studies suggest a closer antigenic relationship 
to type II strains (Corapi et al., 1992). As it now appears, virus stocks may have 
been interchanged. The alleged UCDI strain used by Scotts' group (Corapi et al., 
1992; Olsen et al., 1992), is antigenically IW V. Corapi, personal communica- 
tion) and genetically more similar to 79-1146 (type II) than to the original type I 
UCDI (NWI| strain from Niels Pedersen. 

Although the data are limited and rather sketchy, type I FCoV strains seem 
to predominate in the field. Of 20 sera from natural FIP cases in the United 
States, only one contained neutralizing antibodies directed against FIPV strain 
79-1146 (type II). The remaining 19 antisera neutralized TN-406 (type I) with 
titers ranging from 40 to 3200 (Pedersen et al., 1983). Hohdatsu et al. (1992) 
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screened 237 FCoV-positive sera, as determined by immunofluorescent assay 
(IFA), by using a neutralization assay with FIPV strain 79-1146 as the test virus 
and by a competition enzyme-linked immunosorbent assay (ELISA) using a 
type II-specific MAb. Their results suggest that type II viruses account for only 
20-3096 of the natural FCoV infections in Japan. 

The type I and II viruses also appear to differ in their in vitro growth 
characteristics. The type I strains grow poorly in tissue culture, yielding titers 
of cell-free virus of only 0.5 to 1 x 10° TCID50/ml. More than 90% of the 
infectivity remains cell-associated (Pedersen et al., 1983; Pedersen and Floyd, 
1985; Hohdatsu et al 1991b). In contrast, type II FCoVs grow to high titers of 
cell-free virus (up to 5 x 107 TCID50/ml) and also in these properties closely 
resemble CCV (Pedersen et al., 1983). 

One interpretation of the data is that the FCoV are a heterogeneous group, 
with the type II viruses being genetically more closely related to CCV than to 
type I strains (Pedersen et al., 1983). The fact that some CCV strains induce FIP 
upon experimental inoculation of cats (McArdle et al., 1992) would lend further 
credence to this idea. To study this issue, we compared the 3’-most transcrip- 
tion unit comprising ORFs 7a, 7b, and the 3’ nontranslated sequences (NTR) of 
nine different FCoV isolates. Nucleotide sequence similarities ranged from 87 
to 99% (Herrewegh, Vennema, Horzinek, Rottier, and de Groot, in preparation). 
As shown in Fig. 3, types I and II cannot be distinguished in this part of the 
genome. Clearly, they are much more related to each other than to the CCV 
strains sequenced thus far. 

From serological studies, it appears that the differences between type I and 
II FCoVs are mostly located in the S-protein (Fiscus and Teramoto, 1987a,b; 
Hohdatsu et al., 1991a,b; W. V. Corapi, personal communication). Very recently, 
these findings were confirmed by sequence analysis. The spike genes of type II 
FCoVs bear much greater resemblance to those of TGEV and CCV than to the 
spike genes of type I isolates KU-2 (Motokawa et al 1995) and UCD3 (Vennema 
et al., 1995). How can these observations be reconciled with those made for the 
ORF7a/7b transcription unit? During coronavirus replication, homologous 
RNA recombination occurs at a high frequency. It is quite conceivable that the 
type II FCoV strains have arisen by RNA recombination between a type I FCoV 
and CCV (Vennema et al., 1995; Herrewegh et al., in preparation]. 

Figure 3 also shows that the avirulent “FECV” strains UCD and 79-1683 are 
more similar to pathogenic "FIPV" strains than to each other. These findings 
again indicate that the virulent and avirulent FCoVs are not separate virus 
species. 


VII. ANTIBODY-DEPENDENT ENHANCEMENT 


The key pathogenic event in FIP is the infection of monocytes and macro- 
phages (Ward, 1970; Pedersen, 19762]. Avirulent FCoV strains remain confined 
to the digestive tract and usually do not spread beyond the intestinal epithelium 
and regional lymph nodes (Pedersen et al., 1981b, 1984]. The virulent strains, 
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however, disseminate to other organs most likely via blood-borne monocytes 
(Weiss and Scott, 1981a). The virulence of FCoV strains appears to correlate with 
their ability to infect feline peritoneal macrophages in vitro. When virulent and 
avirulent strains were compared, the latter infected fewer macrophages and 
produced lower virus titers than the virulent strains. Moreover, the avirulent 
strains were less able to sustain viral replication and to spread to other macro- 
phages (Stoddart and Scott, 1989). 

There is ample evidence for an involvement of the immune system in the 
pathogenesis of FIP. Several authors have emphasized the similarities between 
FIP and immune-mediated diseases such as Aleutian disease of mink and Den- 
gue hemorrhagic fever (Horzinek and Osterhaus, 1979; Pedersen and Boyle, 
1980; Weiss and Scott, 1981b; Halstead, 1988). Humoral immunity is obviously 
not protective. FCoV-seropositive cats that are experimentally infected with 
FIPV often develop an accelerated, fulminating course of the disease. Clinical 
signs and lesions develop earlier, and the mean survival time is dramatically 
reduced compared to seronegative cats (early death syndrome) (Pedersen and 
Boyle, 1980; Weiss et al., 1980; Weiss and Scott, 198la—c}. Direct evidence for the 
involvement of antibodies in early death syndrome was obtained by passive 
immunization of cats with the purified immunoglobulin IgG fraction of feline 
aFCoV-antisera. These cats also developed enhanced FIP upon an experimental 
challenge infection (Pedersen and Boyle, 1980]. 

Heterologous expression of S, M, and N provided a means to assess the role 
of each individual protein in FIP pathogenesis (de Groot et al., 1989; Vennema et 
al., 1990b, 1991). S was unequivocally identified as the main, if not the sole, 
factor in the induction of early death (Vennema et al 1990b, 1991). Kittens that 
had been infected with a recombinant vaccinia virus (recVV) expressing S died 
within 9 days after an FIPV challenge infection. In contrast, kittens that had 
been infected with wild-type vaccinia virus or with recVVs expressing M or N 
either survived FIPV infection or succumbed after a protracted course of the 
disease (21-41 days) (Vennema et al., 1990b, 1991). 

The early death syndrome can be explained by an antibody-dependent 
enhancement of virus infection (ADE), a phenomenon described for a wide 
variety of viruses, including the human immunodeficiency virus (Takeda et al., 
1988; for reviews, see Porterfield, 1986; Halstead, 1988). Binding of antibody to 
virus produces infectious immune complexes that attach to the surface of the 
target cell (via Fc or complement receptors) with higher efficiency than virus 
alone (Chanas et al., 1982; Burstin et al., 1983; Gollins and Porterfield, 1984, 
1985; Halstead and O’Rourke, 1977; Peiris and Porterfield, 1979; Peiris et al., 
1981]. 

Cheryl Stoddart (1989) provided the first in vitro evidence for an involve- 
ment of ADE in FIP by using feline peritoneal macrophage cultures (Stoddart 
and Scott, 1988). The number of macrophages infected with FIPV UCD-1" in 
vitro increased up to 12-fold when the virus had been preincubated with serum 
from an FIPV-sensitized cat. This enhancement also occurred when using the 
high-pressure liquid chromatography (HPLC)-purified IgG-fraction of the se- 
rum. ADE could be prevented by blocking the macrophage Fc receptor with 
aggregated IgG or by blocking the Fc portion of the enhancing IgG with protein 
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A (Stoddart, 1989). The levels of enhancement varied greatly between experi- 
ments with different batches of macrophages; similar difficulties have been 
experienced in ADE studies with peripheral blood leukocyte preparations (Por- 
terfield, 1986). 

Recently, these observations were confirmed and extended by Hohdatsu et 
al. (1991c) and by Fred Scott's group (Olsen et al., 1992; Corapi et al., 1992). In 
these studies, ADE in primary feline macrophages was demonstrated both with 
feline antisera (Olsen et al., 1992; Olsen and Scott, 1993) and with murine 
monoclonal antibodies (Hohdatsu et al 1991c; Olsen et al., 1992; Corapi et al., 
1992). The variation between experiments was minimized by using batches of 
macrophages obtained from a single cat (Olsen et al., 1992). Upon screening of 
67 MAbs specific for S, M, and N, 17 MAbs were identified that enhanced FIPV 
infection (Olsen et al., 1992, 1993; Corapi et al., 1992). Consistent with the data 
of Vennema et al. (1990b, 1991), the ADE-inducing MAbs were all directed 
against the S protein (Olsen et al., 1992; Corapi et al., 1992). There was a distinct 
correlation between the ability of the MAbs to neutralize infection of CrFK 
cells and the induction of ADE: 15 of 19 neutralizing antibodies enhanced FIPV 
infection. All but one of the enhancing MAbs were of the immunoglobulin G2a 
subclass; the four neutralizing MAbs that did not induce ADE were of the G1 
subclass. From these findings Corapi et al. (1992) suggested a restriction in the 
immunoglobulin subclasses mediating ADE. The findings of Hohdatsu et al. 
(1991c) seem to conflict with those of all other authors in that nonneutralizing 
MAbs directed against both M and S induced ADE. However, the enhancement 
levels obtained with these MAbs were very low (two- to threefold}, while in the 
studies of Olsen et al. (1992) and Corapi et al. (1992) some MAbs produced up to 
a 100-fold increase in the number of infected cells. The most convincing 
enhancement reported by Hohdatsu et al. (1991c) (a sixfold increase in the 
number of infected cells) notably occurred with a neutralizing S-specific MAb 
of the immunoglobulin G2a subclass. 

There are at least five distinct neutralization sites on the S protein of FIPV, 
four of which have been implicated in ADE (Corapi et al., 1992). By kinetics- 
based competitive ELISA, two of the enhancement sites were found to correlate 
with the previously defined sites A and E/F of the TGEV S protein (Olsen et al., 
1992). It was also shown that aTGEV MAbs directed against these sites en- 
hanced FIPV infection in vitro (Olsen et al., 1993). By sequence analysis of MAb- 
resistant mutants and PEPSCAN epitope mapping, amino acid residues in- 
volved in the formation of TGEV site A have been localized between amino acid 
residues 538 and 591 (Correa et al., 1990; Gebauer et al., 1991). The epitopes of 
the enhancing FIPV site A MAbs are probably located in the homologous region 
(residues 543-597) (Jacobs et al., 1987) on the FIPV spike (Olsen et al., 1993). 

Olsen et al. (1992) noted strain variations of ADE in that certain MAbs 
would enhance FIPV UCD-1* more than 79-1146 and vice versa. This is most 
likely related to the epitopes recognized by the enhancing MAbs and their 
relative avidity. These observations may in part explain why FIPV does not 
always cause antibody-mediated early death in seropositive cats; thus, early 
death did not occur in kittens that had been inoculated with either FECV UCD, 
FECV 79-1683, or FIPV UCD-2 prior to a lethal challenge with FIPV 79-1146 
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(Pedersen et al., 1984; Pedersen and Floyd, 1985; Fiscus et al 1987). Of the two 
kittens vaccinated with FIPV UCD-4 and challenged with a lethal dose of FIPV 
79-1146, only one developed enhanced FIP (Pedersen and Floyd, 1985; Pedersen, 
1987b). Similarly, of four kittens immunized with FIPV UCD-4, only two died of 
accelerated FIP after a challenge infection with FIPV UCD-1; one kitten died 
after the normal, protracted course of the disease, and the other one survived. 
The combined data indicate that a particular virus-antibody combination is 
required for the development of ADE and early death syndrome. 

The mechanism of ADE in FIPV remains to be elucidated. An important 
question that needs to be addressed is whether the dedicated cell surface recep- 
tor of FIPV is involved. Scott and co-workers have argued that the enhanced 
infection occurs via a very efficient form of FcyR-mediated endocytosis that is 
independent from the FIPV receptor (Corapi et al., 1992; Olsen et al 1992). This 
hypothesis is based upon (1) the "sheer magnitude" of the increase in the 
number of infected cells due to ADE, (2) the observation that the macro- 
phagelike murine IC-21 cells could be infected with FIPV only in the presence of 
an enhancing MAb (Corapi et al., 1992), and (3] the as yet unconfirmed claim 
that ADE can be blocked by lysosomotropic amines (Stoddart, 1989). In this 
model, the nucleocapsid would be released into the cytoplasm by fusion of the 
viral envelope and the endosomal membrane (illustrated in Fig. 4B). 

The above model is difficult to reconcile with other observations made for 
coronavirus entry. During normal entry, membrane fusion is mediated by the 
viral spikes; it occurs readily at neutral pH, but is inhibited at lower pH values. 
Therefore, coronaviruses most likely enter the host cell by fusion at the plasma 
membrane rather than via the endosomal route (see also Chapter 5). Further- 
more, heterologously expressed spike proteins induce membrane fusion in a 
host cell-dependent manner: the spike protein of FIPV only induced cell fusion 
in cells of feline origin. Similarly, fusion by the S protein of mouse hepatitis 
virus was restricted to murine cells (de Groot et al., 1989; Vennema et al., 
1990a). These data indicate that membrane fusion is a highly specific process, 
most likely requiring recognition of the cell surface receptor by the viral spikes 
(Fig. 4A). 

Corapi's experiments with the murine IC-21 cells (Corapi et al., 1992) 
would suggest that during ADE FcyRs suffice for viral entry. However, it cannot 
be excluded that IC-21 cells carry a low affinity receptor for FIPV that is only 
recognized efficiently when virus—antibody complexes are fixed by the FcyRs at 
the plasma membrane. Support for the existence of FIPV receptors on murine 
cells stems from studies in which FIPV strains were adapted to murine Sac- 
cells (de Groot, unpublished results} or propagated in brains of suckling mice 
(Osterhaus et al., 1978a). 

We currently favor a model for ADE in which viral entry is mediated by 
interaction between the spike and the virus receptor, where the probability of 
this event is increased due to a more efficient binding of the opsonized virus to 
the cell surface (explained in Fig. 4C). This mechanism would be similar to that 
found for human immunodeficiency virus (HIV): ADE of HIV infection requires 
both the FcyR and the HIV cell receptor, CD4 (Takeda et al., 1990; Perno et al., 
1990; Zeira et al., 1990; Connor et al., 1991). 
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FIGURE 4. Models for viral entry during (A) normal infection and (B,C) ADE. (B) Model for ADE, in 
which entry occurs via the endosomal route. Adsorption and penetration requires only the Fc 
receptor. (C) Model for ADE, in which viral entry requires both the Fc receptor and the virus 
receptor. 


Clearly, the receptor for FIPV must be identified before this issue can be 
resolved. Two closely related viruses, TGEV and HCV 229E, adsorb to and enter 
their host cell via the aminopeptidase N (APN) molecule, a membrane-bound 
metalloprotease (Delmas et al., 1992; Yeager et al., 1992). The tissue distribu- 
tion of APN coincides with the cell tropism of FIPV; it is present on enterocytes 
and macrophages (Look et al., 1989}. It is thus quite likely that FIPV utilizes 
APN as a receptor aš well. 
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VIII. IMMUNE-MEDIATED LESIONS IN FIP 


Most authors consider the vascular and perivascular lesions in FIP to be 
immune-mediated, but there is uncertainty about the actual pathogenic mech- 
anism. At least some vascular injury may be attributed to immune-mediated 
lysis of infected cells (Weiss and Scott, 1981a): FIPV-infected white blood cells 
were detected in the lumen, intima and wall of veins and in perivascular 
locations (Weiss and Scott, 1981a; Weiss et al., 1988). Furthermore, inflamma- 
tory mediators such as cytokines (Goitsuka et al., 1988, 1990; Hasegawa and 
Hasegawa, 1991), leukotrienes, and prostaglandins (Weiss et al., 1988) that are 
released by infected macrophages could play a pivotal role in the development 
of the perivascular pyogranulomata. As hypothesized by Weiss et al. (1988), 
these products could induce vascular permeability changes and provide addi- 
tional chemotactic stimuli for neutrophils and monocytes. In response to the 
inflammation, the attracted cells may release additional mediators and cyto- 
toxic substances; the monocytes would also serve as new targets for FIPV. The 
end result of this would be enhanced local virus production and increased tissue 
damage. 

Other observations point toward an immune complex pathogenesis. Depo- 
sition of immune complexes and subsequent complement activation is thought 
to cause an intense inflammatory response that may extend across blood vessel 
walls. The resulting vascular damage would permit leakage of fluid into the 
intercellular space and eventually lead to the accumulation of thoracic and 
abdominal exudate (Hayashi et al., 1977; Horzinek and Osterhaus, 1979; 
Jacobse-Geels et al., 1980, 1982; Pedersen and Boyle, 1980; Weiss et al., 1980; 
Weiss and Scott, 1981a-c]. The morphological features of the vascular lesions 
(necrosis, polymorphonuclear cell infiltration associated with small veins and 
venules} strongly indicate an Arthus-type reaction (Hayashi et al 1977; Weiss 
and Scott, 1981c). The lesions contain focal deposits of virus, IgG, and C3 
(Pedersen and Boyle, 1980; Weiss and Scott, 1981b). Moreover, complement 
depletion and circulating immune complexes were demonstrated in cats with 
terminal FIP (Pedersen and Boyle, 1980). In a horizontal study of six experimen- 
tally infected cats, first clinical signs were accompanied by increased C3 con- 
centrations in the plasma; subsequently, antibody titers and circulating im- 
mune complexes (ICX) increased with concomitant decrease of complement 
concentrations. At the time of death, maximum ICX and minimum C3 concen- 
trations were measured (Jacobse-Geels et al., 1982). Glomerular ICX deposits 
were found in FIPV-infected cats but not in uninfected controls (Jacobse-Geels 
et al., 1980). 


IX. VACCINE DEVELOPMENT 


The development of vaccines against FIPV is cumbersome and has been 
frustrated by the occurrence of ADE. Vaccination with heterologous live virus 
vaccines (TGEV, CCV, HCV 229E] has failed to provide protection (Witte et al., 
1977; Toma et al., 1979; Woods and Pedersen, 1979; Barlough et al., 1984, 1985). 
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Occasionally, protective immunity can be induced by administering FECV, low 
virulent FIPV (e.g., FIPV UCD-3), or sublethal amounts of virulent FIPV (Pe- 
dersen and Black, 1983; Pedersen et al., 1984; Pedersen and Floyd, 1985; Peder- 
sen, 1987b). However, as quoted from Pedersen and Black (1983, p. 20), the 
results of immunization with homologous live virus vaccines were "too incon- 
sistent and hazardous to have clinical relevance": while some cats became 
solidly immune, others developed disease or became sensitized. Also, cats 
immunized with recombinant vaccinia viruses expressing the spike gene of 
FIPV 79-1146 (Vennema et al., 1990b) or of FECV 79-1683 (Bhogal, Martinez, 
Reed, KLepfer, Jones, Pfeiffer, and Miller, in preparation) developed early death 
syndrome upon an FIPV challenge infection. 

A temperature-sensitive strain of FIPV has been shown to stimulate protec- 
tive immunity in cats. Upon intranasal inoculation into cats, virus replication 
appeared to be restricted to the upper respiratory tract. The mutant did not 
differ from the wild-type virus with respect to RNA and protein synthesis, but 
produced little progeny at the nonpermissive temperature (Christianson et al., 
1989. Gerber et al., 1990). There is still controversy over the safety and effi- 
caciousness of this vaccine in that protection may depend on the strain and dose 
of the challenge virus (Olsen, 1993). Furthermore, even as avirulent and immu- 
nizing modified live FIPV strains become available, their widespread use as 
vaccines is not encouraged in view of the notorious genetic flexibility of corona- 
viruses. Thus it has been demonstrated that new epidemic strains of avian 
infectious bronchitis virus have arisen by point mutations in the genomes of 
attenuated vaccine strains (Kusters et al., 1987). While this may be acceptable in 
production animals where new vaccine strains can be employed at intervals, 
the high individual value of a companion animal would plead against the use of 
live coronavirus vaccines. The possible risks are best illustrated by outbreaks of 
an unprecedented FIP-like disease in dogs after the widespread use of a modified 
live CCV vaccine (Martin, 1985). 

Designing safe and effective FIP vaccines that avoid immunopathology will 
probably require a better understanding of the complex interplay between the 
virus and the host immune system and, perhaps just as important, identifica- 
tion of the viral proteins involved. The molecular cloning of the FIPV genome 
and the isolation of viral genes provides powerful tools to tackle these prob- 
lems. Obviously, the S protein is not the most attractive candidate for vaccine 
development (Vennema et al., 1990b; Bhogal, Martinez, Reed, Klepfer, Jones, 
Pfeiffer, and Miller, in preparation}. The role of M in immunity against FIPV is 
still unclear. Upon immunization of eight cats with a vaccinia virus recombi- 
nant expressing M, three kittens survived, while two other kittens showed 
protracted survival times compared to the controls (Vennema et al., 1991). 
Clearly, these findings deserve further scrutiny. 


X. CONCLUDING REMARKS 


Substantial progress has been made in FIP research. Our knowledge of the 
molecular genetics of FCoVs has increased considerably as has our understand- 
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ing of the mechanism of ADE and early death syndrome. Other areas would 
require closer examination, for instance, the role of cellular immunity in pro- 
tection against FIP. As feline cytokine genes are cloned and expressed, new tools 
become available to modulate the feline immune response and possibly also FIP 
pathogenesis. Another interesting field is the presumptive FCoV carrier state. It 
remains to be determined whether FCoVs can cause a true persistent infection, 
and if so, in which cells and tissues of the host the virus resides and how it 
manages to escape immunosurveillance. Finally, FCoVs produce a number of 
proteins that are dispensable for replication in tissue culture cells, most notably 
gp7b. What is the function of these proteins during natural infection? Though 
the last ten years of FIP research have been exciting, the coming years promise 
to be better still. 
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CHAPTER 15 


Epidemiology of Infectious 
Bronchitis Virus 


JANE K. A COOK AND A. P. A. MOCKETT 


I. INTRODUCTION 


Avian infectious bronchitis virus (IBV) is the type species of the genus Corona- 
virus of the family Coronaviridae. It is of considerable economic importance to 
the poultry industry worldwide, causing a highly contagious disease affecting 
the respiratory, reproductive, and renal systems. 


II. HOST RANGE 


The chicken is believed to be the only natural host for IBV; all ages can be 
infected and both sexes appear equally susceptible. Vindevogel et al. (1976) 
found that pigeons were resistant to IBV infection. However, Biondi and 
Schiavo (1966) successfully infected quail but not seven other avian species 
experimentally with IBV, and Cumming (1969) was able to recover the virus 
from the kidneys of one of six magpies inoculated via the conjunctiva. Barr et al. 
(1988) isolated IBV from a flock of racing pigeons with respiratory signs, but 
failed to transmit infection to pigeons experimentally. Based solely on serologi- 
cal data, Weisman et al. (1987) claim to have demonstrated the presence of IBV 
in healthy turkeys, although Cumming (1969) was unable to infect turkeys 
experimentally with IBV. Spackman and Cameron (1983) isolated IBV from 
pheasants with a history of respiratory signs and aberrant egg production, and 
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the disease problem in the pheasants was successfully controlled by the use of 
an oil-based inactivated IB vaccine. This and a further report (Lister et al 1985) 
suggest that the pheasant could be a second natural host for IBV and is the only 
evidence of the ability of IBV to infect avian species other than the chicken. 


III. INCIDENCE 


Avian infectious bronchitis is reported from all countries where an inten- 
sive poultry industry has developed; in most countries it appears as a serious 
disease problem shortly after the development of the industry in that area. The 
relative importance of IBV in a country appears to fluctuate from time to time, 
being dependent on such factors as the percentage of the poultry population 
that is vaccinated and on the emergence of virus types against which current 
vaccines are not wholly effective (see Section VIIA.]. Another factor governing 
the reported incidence of IBV is the efficacy of diagnostic and research facilities 
in different countries. A detailed review of the incidence of IBV worldwide can 
be found in the proceedings of the first and second international symposia on 
infectious bronchitis (Kaleta and Heffels-Redwan, 1988, 1991). 


IV. TRANSMISSION 


The most important route of spread of IBV is from the respiratory tract, 
probably by aerosol. During the clinical phase of the disease and for approx- 
imately 28 days after infection, virus can be recovered intermittently from the 
respiratory tract (Cook, 1968). However, IBV replicates in many tissues through- 
out the chicken (Hofstad and Yoder, 1966; Doherty, 1967), and once chickens 
have recovered clinically, the feces are possibly the best site from which to 
recover virus (Cook, 1984], the intestinal tract (El Houadfi et al., 1986; Ambali 
and Jones, 1990) and specifically the cecal tonsil (Cook, 1968) being suggested as 
a likely reservoir for IBV. These findings emphasize that the importance of fecal 
shedding in the spread of the virus should not be overlooked; although inhala- 
tion of airborne virus may be the most common method of transmission, 
ingestion of virus via contaminated food, drinking water, or litter plays an 
important role in the epidemiology of IBV. 

While lateral transmission is undoubtedly the most important method of 
IBV spread, it has been shown that virus can be recovered from eggs laid by 
experimentally inoculated hens for up to 6 weeks after inoculation with a high 
virus dose (Cook, 1971) and also from a small number of day-old chicks hatched 
from those hens. However, the progeny showed no signs of IB infection. Earlier 
work (Cook and Garside, 1967) had shown that it was possible to hatch and rear 
IB-free chicks from dams showing a high incidence of infection; the conclusion 
was drawn that egg transmission is of negligible importance in the epizootiol- 
ogy of IBV infections. One interesting finding in that work was that IBV could 
be recovered from semen of infected cockerels for up to 14 days after inocula- 
tion, a finding that could have implications for the spread of IBV. 
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1. In the Chicken 


There is no conclusive evidence that a true carrier state exists with IB, but 
virus clearly persists for a considerable time in experimentally inoculated 
chicks. Cook (1968) recovered virus for 49 days from experimentally inoculated 
chicks housed in strict isolation, and could recover it for over 4 months when 
isolation was less effective. It was suggested in the latter case that continual 
reinfection from shedders had occurred. Alexander et al. (1978) isolated IBV 
from the feces of chickens up to 227 days after inoculation of two different 
strains of IBV. These birds were not housed in isolation and again reinfection 
was a possibility. Therefore, although IBV can be isolated in the presence of high 
concentrations of humoral antibody, it was concluded that a true carrier state 
does not occur. However, the demonstration by Jones and Ambali (1987) of 
reexcretion of an enterotropic IBV strain at point of lay, with virus being 
recovered from both tracheal and cloacal swabs, suggests that a state of latency 
may exist. 


2. In the Environment 


The question of persistence of IBV in the environment is a complex one 
because of conflicting evidence on the survival of IBV under different environ- 
mental conditions. Otsuki et al. (1979) demonstrated variation in the resistance 
of ten IBV strains to chemical and physical treatments. During epidemics the 
virus can certainly be transmitted between farms. For example, the same IBV 
serotype has been isolated from four different farms within a 10-mile radius 
over a period of 13 months (Cook, 1988). Airborne spread is likely to have been 
involved, but movement of personnel and equipment probably aided transmis- 
sion. The climatic and other conditions that govern spread are not fully under- 
stood, and many aspects of IBV transmission still require elucidation. More- 
over, the importance of strain variation in the transmission and persistence of 
IBV has not received the attention which the subject merits. 


V. PATHOGENESIS 
A. Clinical Signs 


IBV can induce respiratory signs, affect the reproductive tract, and cause 
nephritis. Young chickens up to 4 weeks of age appear to be most susceptible to 
respiratory disease. The clinical signs, which appear two to three days after 
infection, include nasal discharge, sneezing, coughing, and tracheal rales. Ca- 
seous or catarrhal plugs may be present in the trachea or bronchi. Swollen 
sinuses may occur in some chickens. IBV respiratory infection predisposes 
chickens to infection with secondary pathogens (see Section VC L although 
chickens will usually recover quickly from an uncomplicated viral infection. 
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IBV infection of susceptible layers can result in a drop in egg production or 
failure to lay at full potential. The extent of the drop varies, depending on such 
factors as time of infection in relation to stage of egg production and the general 
health of the flock. Drops in production of 5 to 20% are not uncommon, 
together with the production of increased numbers of downgraded eggs with 
thin, misshapen, rough, and soft shells. Watery whites, in which there is no 
clear demarcation between the thick and thin albumins, are often produced. 
Although layers usually recover from the disease, their egg production rarely 
returns to preinfection levels. IBV infection of chickens less than 2 weeks of age 
can cause damage to the developing reproductive tract, resulting in “false 
layers” that do not lay normally at sexual maturity (Broadfoot et al., 1956; Jones 
and Jordan, 1970). 

Nephritis has been reported as a result of IBV infection. Early reports were 
from Australia and showed that certain strains, such as the T strain, had a 
predilection for the kidney (Cumming, 1963). However, the incidence of nephri- 
tis associated with IBV has increased and has been reported from other coun- 
tries, including Italy (Zanella et al., 1988), Belgium (Froyman et al., 1985), 
France (Picault et al., 1987), Japan (Otsuki, 1988), and the United States (Kinde 
et al., 1991). The reason for this increased incidence of renal involvement 
remains unclear, and particular IBV serotypes do not necessarily appear to be 
responsible. Swollen, pale kidneys with the tubules and ureters enlarged, due to 
the presence of urates, are often observed. Polydipsia may also be present. In 
some outbreaks in Belgium, mortality rates of up to 20% have been reported 
(Nauwynck and Pensaert, 1988). 


B. Histopathology 


This has been covered in previous reviews, for example, King and Cavanagh 
(1991). 


C. Secondary Infections 


The severity of the infection caused by IBV can be increased both by 
environmental factors and by the presence of other microorganisms that may or 
may not be pathogens in their own right. In such cases, severe air sacculitis, 
often accompanied by peritonitis and pericarditis, can result and mortality may 
occur. In adult chickens combined infection with IBV and Mycoplasma galli- 
septicum has been shown to cause a more severe effect on egg production and 
quality than inoculation with either agent alone (Blake, 1962). 

Other viruses, bacteria, and mycoplasmas have all been shown to exacer- 
bate respiratory disease caused by IBV (Jordan, 1972), although most studies 
have incriminated Escherichia coli and Mycoplasma synoviae. Fabricant and 
Levine (1962) and Springer et al. (1974) found that while synergism, as based on 
severity of air sac lesions, could be demonstrated following inoculation with 
IBV and either E. coli or M. synoviae, the most severe infection resulted from 
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inoculation of the three agents together. However, Smith et al. (1985) were able 
to reproduce a highly lethal infection, closely resembling the natural disease, by 
intranasal inoculation of IBV and invasive serotypes of E. coli. Infection of 
similar severity was caused if the E. coli were given at any time between 7 days 
before and 7 days after the virus. Some insight into the mechanism involved 
was provided by the observation that, when inoculated intranasally alone, the 
E. coli replicated poorly in the upper respiratory tract and did not invade. 
However, when IBV was included in the inoculum, E. coli multiplied to very 
high titer in the respiratory tract with subsequent penetration of the invasive E. 
coli strains but not of commensal coliforms. These authors suggested that the 
E. coli were able to invade as a result of the damage that IBV had caused in the 
respiratory tract, a finding confirmed by Nakamura et al. (1992), who showed 
that histopathological lesions in the respiratory tract were more severe when IB 
and E. coli were inoculated together than when either agent was inoculated 
alone. | 


D. Immunity 


It is well established that the chicken responds to IBV infection by produc- 
ing specific antibodies of the immunoglobulin IgM, IgG, and IgA classes, al- 
though the main serum antibody is IgG. Chickens with high levels of specific 
humoral antibodies can still be susceptible to infection, however, suggesting 
that immune factors other than humoral antibody are involved in protection. 

IgM is the first class of antibody to be detected in the serum following IBV 
infection; peak concentration is reached about 8 days after infection and levels 
then decline (Mockett and Cook, 1986). As the presence of this antibody is only 
transitory, it may be useful in the diagnosis of recent infection (Martins et al., 
1991). Since plasma cells soon switch from producing specific IgM to IgG, the 
latter immunoglobulin, although present at relatively low concentrations 7 
days after infection, reaches highest concentrations by about 10 to 14 days, and 
these levels are maintained for a considerable time. The antiviral IgG response, 
shown by enzyme-linked immunosorbent assay (ELISA), is similar to that 
shown by the hemagglutination inhibition test, although the former is much 
more sensitive. Virus-neutralizing (VN) antibodies are detectable from 9 days 
after infection, but take longer to reach a peak (Mockett and Darbyshire, 1981). 
Similar antibody profiles to both the spike and membrane protein have been 
demonstrated using ELISA (Mockett, 1985). It is generally believed that the 
spike protein is responsible for inducing VN antibodies and protective immu- 
nity (Mockett, 1985; Tomley et al., 1987; Cavanagh et al., 1984, 1988], although 
recent evidence suggests that such immunity may also be induced by the 
nucleocapsid protein (Boots et al 1992). 

High concentrations of specific VN antibodies are thought to be important 
in preventing dissemination of the virus from its primary replication site in the 
respiratory tract (Box, 1988; Otsuki, 1988). Hence, such antibodies may prevent 
viral replication in and damage to the reproductive tract and the kidneys. It is 
also known that high concentrations of maternally derived antibodies, which 
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are transmitted from the hen to the chick via the yolk, can help prevent 
infection, although for only until 1 to 2 weeks of age (Mockett et al 1987). 

IBV-specific local antibodies have been detected in lachrymal fluid and in 
nasal and tracheal washings. The antibodies are thought to be produced by 
plasma cells in lymphoid tissue such as the Harderian gland. The production of 
both IgG and IgA has been reported (Holmes, 1977; Davelaar et al., 1982; 
Hawkes et al 1983; Lütticken et al 1988). Recent work has suggested that the 
amount of IgA produced in saliva and lachrymal fluid after infection may be 
related to the reduced pathogenicity of IBV for certain inbred lines of chickens 
(Cook et al., 1992). It is interesting to note that aerosol vaccination of day-old 
chicks results in protective immunity, presumably because local immune 
mechanisms have been stimulated (Davelaar and Kouwenhoven, 1980). 

Although the subject has received little attention, cell-mediated immune 
responses have been reported following IBV infection. Systemic cell-mediated 
immunity (CMI) has been detected using a lymphoblast transformation tech- 
nique (Timms et al., 1980). There was no direct correlation between CMI and 
humoral antibody production (Timms and Bracewell, 1981). CMI has been 
demonstrated to the S1, N, and M proteins of IBV (Ignjatovic and Galli, 1993). 
Although in that work only the S1 glycoprotein protected against challenge, 
Boots et al. (1992) demonstrated that a recombinant product expressing the N 
protein induced a CMI response and some protective immunity. 


E. Host Factors 


1. Age Resistance 


Although chickens of all ages are susceptible to infection, very young 
chicks tend to show more severe respiratory signs than older ones and mor- 
bidity is generally only observed in very young chicks. Smith et al. (1985) found 
that chicks rapidly became more resistant to a mixed IBV and E. coli infection 
with increasing age. Whereas approximately 90% mortality was recorded fol- 
lowing day-old inoculation, this fell to 2096 when inoculation was delayed until 
6 weeks. 


2. Genetic Resistance 


The availability of genetically defined, inbred lines of chicken has enabled 
genetic differences in the response to IBV infection to be studied. Purchase et al. 
(1966) demonstrated some variation in mortality following IBV inoculation of 
embryos from different inbred lines. Using a mixed IBV and E. coli infection, 
Smith et al. (1985) and Bumstead et al. (1989} have demonstrated large differ- 
ences in mortality, ranging from 3 to 87%, in different inbred and partially 
inbred lines of chicken. However, even in the more resistant lines clinical signs 
of respiratory infection were recorded and pericarditis and peritonitis observed 
in some of the chicks, indicating that the severity of the IBV infection was 
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sufficient to allow E. coli to invade, although not to cause a lethal infection. As 
a result of challenging with IBV or E. coli alone, Bumstead et al. (1989) con- 
cluded that the variation in mortality in different lines reflected differences in 
resistance to IBV rather than to E. coli. Examination of Fl crosses suggested that 
mortality differences could not be attributed to maternal effects and that inher- 
itance of resistance was fully dominant. The mortality pattern in F2 and back- 
cross generations was compatible with the inheritance of a dominant auto- 
somal resistance gene and showed no evidence of association with the major 
histocompatibility complex. 

During this study, it was found that up to 8796 mortality occurred in some 
inbred lines of chicken following intranasal inoculation of IBV alone. This 
enabled a comparison to be made of different parameters of an IBV infection in 
highly resistant (White Leghorn line C) and highly susceptible (White Leghorn 
line 151) chickens. It was found (Otsuki et al 1990; Nakamura et al 1991) that 
the respiratory infection and degree of damage to the trachea were more severe 
and longer lasting in the susceptible line. While similar amounts of virus were 
recovered initially from tissues of both lines, overall more was recovered and for 
longer from the susceptible line. However, tissues from each line were equally 
susceptible to in vitro inoculation of IBV, suggesting that resistance to IBV 
infection is not related to a lack of specific viral receptors by target cells of the 
resistant line, as is the case with murine hepatitis virus (Boyle et al 1987). This 
finding and the observation of some increased susceptibility to IB infection in 
bursectomized chicks (Cook et al., 1991a) suggests that some other mechanism 
must play a role in resistance to IBV infections. The nature of this mechanism 
has yet to be identified. 


F. Viral Factors 


The ability of IBV to vary antigenically has been known for many years, 
based mainly on the identification of different serological types using virus 
neutralizing tests (Hopkins, 1974; Darbyshire et al., 1979). As techniques for 
performing such tests have improved, it has became increasingly easy to define 
serotypes clearly. Many new serotypes have now been identified, particularly in 
Europe where most of such work has been done (Davelaar et al 1983; Cook, 
1984; Cook and Huggins, 1986}, but also in other countries, for example, Chile 
(Cubillos et al., 1991) and the United States (Gelb et al., 1991) (Table I). One 
reason for the increase in numbers of antigenic types of IBV may be the wide- 
spread use of live attenuated IB vaccines, leading to the preferential multiplica- 
tion of virus populations able to escape neutralization by vaccine-induced 
antibodies (Kusters et al., 1987). However, nucleotide sequencing has indicated 
that some IBV isolates are genetic recombinants. The possession by two se- 
rologically distinct IBV strains of very similar M but very distinct S1 proteins 
was suggested as evidence of recombination (Cavanagh and Davis, 1988). Kus- 
ters et al. (1989) compared S1 sequence data from several IBV isolates with T1 
fingerprints of the entire IBV genome and reached the same conclusion. Using 
nucleotide sequencing, Cavanagh et al. (1992a) showed that although the S 
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sequence of a Massachusetts serotype strain (322/82) was very similar to Massa- 
chusetts, the 3C and M gene sequences showed greater homology to those of 
European non-Massachusetts serotypes, thereby providing evidence of recom- 
bination between IBV strains. Similarly, Wang et al. (1993) have shown a US 
field isolate of IBV to be a recombination between the Massachusetts and 
Arkansas serotypes, the recombination apparently having occurred within SI. 

The finding of sharply defined serological differences between IBV strains, 
however, does not imply lack of in vivo cross-protection between them. Using 
as a challenge system a mixed IBV and E. coli infection, it has been shown (Cook 
et al., 1986) that IBV strains may protect against challenge with antigenically 
distinct isolates and that protection is much more broadly based than might be 
inferred from serological data. It may eventually be shown that the grouping of 
strains into antigenic groups by the use of, for example, monoclonal antibody 
(MAb)-based ELISAs (Ignjatovic et al., 1991) will correlate better with cross- 
protection data than do the results of virus neutralizing tests. 

It is known that serotype-specific VN antibodies are induced by the highly 
variable amino-terminal S1 subunit of the spike glycoprotein (Mockett et al., 
1984, Cavanagh et al., 1986; Karaca et al., 1992). A surprising finding from 
sequence analysis of S1 of several British isolates belonging to different se- 
rotypes was the similarity of their S1 sequence, the strains differing by only 2% 
of their S1 residues (Cavanagh et al., 1992b). This provides further evidence that 
only a few amino acids on S1 form epitopes responsible for inducing VN anti- 
body. Thus, a few key amino acid substitutions can define a new IBV serotype. 
Analysis of MAb-resistant mutants has shown that an area near the N-terminus 
of Sl is associated with neutralization epitopes (Cavanagh et al., 1988), and 
other data (Cavanagh et al., 1992b; Kant et al., 1992) suggest that the region of 
the first 300 N-terminal residues is an important hypervariable region. As has 
been shown with transmissible gastroenteritis virus (Jimenez et al., 1986], it is 
probable that the tertiary structure of the IBV spike protein is important in 
defining antigenic sites (Koch and Kant, 1990], some of which may be formed by 
juxtapositioning of linearly well separated amino acids. Recent work (Boots et 
al., 1991), using a mouse model, has suggested that antigenic variation may also 
occur in the nucleocapsid of IBV. 

The availability of MAbs (Mockett et al., 1984; Koch et al., 1990; Ignjatovic 
and McWaters, 1991) and the application of molecular techniques to epidemio- 
logical studies are permitting the evolution of IBV strains to be studied. It is 
already clear that the situation is complex; Cavanagh et al. (1988) showed that 
strains of the Massachusetts serotype isolated over many years have retained an 
S1 protein with over 94% homology, while strains with only 50% homology 
coexist (Kusters et al., 1989). It has also been shown (Cavanagh et al., 1992b], 
using neutralizing MAbs, that isolates belonging to different serotypes (defined 
using pooled, polyclonal chicken antibodies) can share common neutralizing 
antibody-inducing epitopes. Thus we have examples of S gene homology both 
within and between serotypes. Much work remains to be done to clarify IBV 
evolution, a subject further complicated by genetic recombination, but it ap- 
pears the IBV strains are evolving in different geographical areas by more than 
one line simultaneously (Cavanagh and Davis, 1992) but at varying rates, de- 
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pending on many factors such as poultry density, vaccination procedures, and 
so on. 

Progress in studying IBV evolution is now coming both from the applica- 
tion of MAb analysis and the use of the polymerase chain reaction (PCR), which 
has recently become a useful additional tool to study IBV. Lin et al.(1991a) 
identified a pair of oligonucleotides spaced 400 bases apart in a conserved region 
of S2. These should identify all IB viruses and indeed they have been used 
successfully to rapidly confirm a viral isolate as IBV (Parsons et al., 1992). 
Oligonucleotides have now been identified that enable the DNA of only S1 to be 
synthesized (Kwon et al 1993a). PCR products have also been taken, the DNA 
cut using appropriate restriction enzymes, and IBV strains typed more rapidly 
than by using conventional virus neutralizing tests (Lin et al 1991b; Kwon et 
al., 1993a). 


VI. DIAGNOSIS 
A. Clinical 


Avian infectious bronchitis infection may be suspected in a flock because 
of the clinical signs, which include respiratory signs, drops in egg production, or 
nephritis. However, other diseases such as Newcastle disease, infectious laryn- 
gotracheitis, and turkey rhinotracheitis can produce similar clinical signs. 
Hence, in order to diagnose an IBV infection it is important to isolate the virus 
and to demonstrate a specific antibody response. 


B. Virus Isolation 


There are a number of ways to propagate IBV, principally tracheal organ 
cultures, embryonated eggs, and tissue culture. Passage of IBV in embryonated 
eggs inoculated via the allantoic route may produce no effects on the embryo 
initially, but after two or three passages dwarfing or death of the embryo is 
usually observed and high titers of infectious virus can be present in the allan- 
toic fluid. Probably all IBV strains grow in tracheal organ cultures or embryo- 
nated eggs but adaption is required for growth in tissue culture. Hence, this 
substrate is not used for primary virus isolation. As mentioned above, swabs of 
nasal discharge collected in the early stages of the disease are a rich source of 
virus as, in the later stages, is fecal material. 

Infected allantoic fluid and tracheal organ culture supernatant fluid fre- 
quently contain enough virus particles to be visualised by electron microscopy. 
The typical size (about 80-100 nm) and shape (usually round with projecting 20- 
nm spikes) of the coronavirus (Berry et al., 1964) enables a presumptive diag- 
nosis to be made. However, the electron microscopy facility is usually found 
only in research establishments. 

The immunofluorescence test is useful in confirming the identity of isolates 
as IBV, and a technique for classification based on neutralization of immuno- 
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fluorescent foci (Csermelyi et al., 1988) has been used successfully. Polyclonal 
antisera, which react with antigenic determinants common to all IBV strains, 
were used and smears of infected chorioallantoic membranes provided the 
substrate (Clarke et al., 1972). 

Antigen assays, based on MAbs, are now becoming more widely used. A 
MAb-based immunoperoxidase procedure has been found to be highly sensitive 
for detecting IBV in infected tissues or chorioallantoic membranes of infected 
embryos (Naqi, 1990). Yagyu and Ohta (1990) used an indirect immunofluores- 
cence assay with a MAb specific for the IBV nucleocapsid protein and were able 
to detect IBV in tracheas of infected chickens for longer than was possible by 
virus isolation. Surprisingly, a streptavidin-biotin immunohistochemical as- 
say was less sensitive than virus isolation (Owen et al., 1991). 

By means of the PCR and a biotin-labeled DNA probe it has been possible to 
amplify and detect IBV in allantoic fluid (Jackwood et al., 1992], in tracheal 
swabs taken from chickens inoculated with different IBV serotypes, and even in 
allantoic fluid of embryos inoculated with field samples from IB outbreaks 
(Kwon et al., 1993b). These authors claim that this technique detects IBV more 
rapidly and efficiently than more conventional methods. 

With the availability of MAbs that are either group-specific or serotype- 
specific (Koch et al., 1990; Karaca et al., 1992), various ELISAs, such as antigen 
capture assays, have now been developed (Cavanagh et al., 1992b; Naqi et al., 
1993) that allow detection of either all IBVs or particular serotypes. However, 
the sensitivity of such assays needs improvement (Nagi et al., 1993). The large 
panel of MAbs now available in The Netherlands has enabled extensive charac- 
terization of existing IB strains to be carried out (Cavanagh et al., 19925). 


C. Serological Methods 


A range of serological tests is available to detect IBV antibodies, including 
agar gel precipitation test, immunofluorescence, hemagglutination inhibition, 
ELISA, and virus neutralization. The agar gel precipitation test requires the 
presence of high concentrations of specific antibodies (precipitins) in order to 
produce a precipitin line; thus the test is relatively insensitive. Precipitins are 
only present for a short period after infection; MacDonald et al. (1982) suggested 
that their presence indicates recent exposure to the virus. The immunofluores- 
cence test is a useful, relatively sensitive assay, but it has not been used 
extensively as a serological test for IBV. 

The hemagglutination inhibition test gained favor when it was found that 
IBV could be made to hemagglutinate chicken red blood cells following treat- 
ment with crude preparations of the enzyme phospholipase C (Bingham et al., 
1975), although more recently neuraminidase has been shown to be the active 
constituent (Schultze et al., 1992). Interestingly, Davelaar et al. (1983) have 
shown that certain IBV strains may spontaneously hemagglutinate chicken red 
blood cells. The hemagglutination inhibition test detects antibodies as early as 
7 days after infection, and these antibodies persist for some time. This test is 
used routinely for flock monitoring and in most research or diagnostic laborato- 
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ries. However, considerable care is required in the performance and interpreta- 
tion of the test (Cook et al., 1987], which should always be carried out under 
carefully controlled conditions such as those described by Alexander et al. 
(1983). 

In some laboratories, the ELISA has gained acceptance. The test is very 
sensitive (Garcia and Bankowski, 1981) and can detect antibodies as early as 3 
days after infection (Mockett and Darbyshire, 1981). ELISAs that use purified 
virus (from isopycnic sucrose gradients) as antigen produce highly specific 
results. As with the hemagglutination inhibition test, antibodies reach highest 
concentrations 14 to 17 days after infection, after which there is a gradual 
decline. It is probable that the ELISA will become the serological test of choice 
for monitoring responses to vaccination since it is especially useful for the 
large-scale screening of poultry flocks. However, neither the ELISA nor the 
hemagglutination inhibition test identify a particular serotype of IBV. For this, 
the virus neutralizing test performed in embryonated eggs (King and Cavanagh, 
1991), cell culture after adaptation of the virus to that system (Hopkins, 1974), or 
tracheal organ cultures (Darbyshire et al., 1979) was until recently the only 
available assay. However, the existence of strain-specific MAbs has led to the 
recent development of serotype-specific ELISAs [Karaca and Naqi, 1993). It 
seems likely that the use of such techniques will increase in the future. 

Since a humoral antibody response is produced following vaccination, it is 
frequently necessary to be able to differentiate between the response to vaccina- 
tion and to field challenge. An infection might be indicated by a rise in antibody 
titer to well above the anticipated vaccinal response level. In order to demon- 
strate evidence of a field challenge, it is necessary to take sequential sets of 
serum samples and demonstrate a rise in antibody titer between them. 


VII. CONTROL 
A. Vaccines 


In most situations the only practical means of preventing IB is to vaccinate 
against the infection, and both live attenuated and inactivated vaccines are 
available. Live attenuated vaccines may be given by eyedrop, aerosol, or in the 
drinking water and are effective in the face of maternally derived immunity 
(Cook et al., 1991b). With live vaccines it may be possible to prevent disease but 
not prevent infection. The more highly passaged virus strains, which have 
reduced pathogenicity, also have reduced immunogenicity. Furthermore, live 
vaccines may only induce short-lived immunity, and hence repeated vaccina- 
tions may be required. It is advisable to monitor regularly the specific antibody 
status of the flock. 

Initially, most of the virulent IBV strains isolated were of the Massa- 
chusetts type. Indeed, it was thought for many years that there was only one 
serotype of IBV. However, other serologically distinct serotypes were subse- 
quently isolated: Connecticut (Jungherr et al., 1956) followed by others such as 
Arkansas (Johnson et al., 1973) or 06 (Gelb et al., 1991) in the United States; 
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D212 and D274 in Holland (Davelaar et al 1981); A to J in England (Cook and 
Ellis, 1988); PL84084 in France (Picault et al., 1986), Az-23/74 in Italy (Zanella, 
1988) and G in Morocco (El Houadfi et al 1986) (Table I). Different IBV sero- 
types appear to be present in distinct geographical areas. Thus the Arkansas 
serotype has not been isolated in Europe, while the D274 serotype has not been 
isolated in the United States. Even in particular countries distinct serotypes 
may be confined to local areas although more than one serotype can coexist in 
the same area (Gelb et al., 1991). 

Live vaccines have been developed to combat some of these serotypes, 
although it is clear that a new IB vaccine is not required for every new serotype 
since protection is more broadly based than serological tests might suggest 
(Cook et al., 1986). From time to time, however, new serotypes emerge against 
which existing vaccines are not fully effective (Picault et al., 1986; Cubillos et 
al., 1991; Parsons et al., 1992; Lambrechts et al., 1993; Pensaert and Lambrecht, 
1994); only in those areas where a known serotype regularly causes disease 
should a live vaccine containing that serotype be recommended. 

It has been assumed that high concentrations of specific IgG antibody can 
protect the kidney and the reproductive tract from the effects of IBV. The aim 
therefore is to induce high concentrations of such antibody. This can be 
achieved by the use of an inactivated IB vaccine given as an oil-in-water emul- 
sion to chickens that have already been “primed” by vaccination with a live- 
attenuated IB vaccine. The slow release of antigen stimulates and maintains the 
specific immune response for a considerable time. The timing of administra- 
tion of inactivated vaccine is important. Chickens that have high concentra- 
tions of circulating antibody do not respond well to inactivated vaccine, pos- 
sibly because the antibody reacts with the antigen and prevents stimulation of 
the B-cell response. If the correct time interval is allowed, however, a chicken 
already primed with a live IB vaccine produces a vigorous B-cell response to the 
inactivated vaccine. 


B. Management of Environmental Factors 


Avian infectious bronchitis is a very labile but highly infectious virus. 
Thus, commonly used disinfectants can kill the virus, and material containing 
IBV is rapidly made noninfectious usually within 24 hr at room temperature. 
However, because IBV is readily spread by fomites and is airborne, careful 
procedures must be undertaken to keep premises clean, for example, by provid- 
ing foot baths and restricted access to the premises. 

The large numbers of chickens commonly housed together provide ideal 
conditions for the spread of IBV, and since the virus is highly infectious it 
spreads rapidly from chicken to chicken in the aerosols created by the coughing 
and sneezing of infected birds. Management should avoid any factors that might 
irritate the chicken’s respiratory tract, for example, by preventing the build up 
of ammonia and ensuring adequate ventilation in poultry houses. It cannot be 
stressed too strongly that good vaccination programs go hand in hand with good 
management. 
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CHAPTER 16 


Molecular Basis of 
Transmissible Gastroenteritis 
Virus Epidemiology 


Luis ENJUANES AND BERNARD A. M. VAN DER ZEIJST 


I. HISTORY OF TRANSMISSIBLE GASTROENTERITIS AND 
CLOSELY RELATED CORONAVIRUSES 


A disease with the characteristics of transmissible gastroenteritis (TGE) was 
first reported in 1935 (Smith, 1956). The viral etiology of TGE was demonstrated 
11 years later by Doyle and Hutchings (1946) in the United States. During the 
next 20 years TGE was reported in all other continents (Table I). Apparently, the 
disease occurred first in those countries that had imported North American 
stock and then was also introduced by European stock (Woode, 1969). There is 
evidence that TGE was not new to the pig, but became important to the pig 
industry concurrently with its intensification (Woode, 1969). 

Until two decades ago, TGE outbreaks in the United States and Europe 
occurred mainly as acute epizooties on breeding farms, particularly during the 
winter months. The disease incidence appeared to have a cyclic course. After an 
outbreak, the disease and the virus disappeared and the herd immunity gradu- 
ally waned in the next 2 to 3 years. A new outbreak was the consequence of the 
reintroduction of transmissible gastroenteritis virus (TGEV). A change in this 
epizootic pattern has occurred in recent decades together with the further 
intensification of swine breeding. TGEV has become more enzootic, partic- 
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TABLE I. Initial Reports of Transmissible Gastroenteritis 
in Various Countries 


Year Country or Continent References 

1946 United States Doyle and Hutchings (1946); R. Wesley 
(personal communication) 

1956 England Pritchard (1987) 

1956 Japan Sasahara et al. (1958) 

1956 France Jestin et al. (1987a,b) 

1957 Denmark Woode (1969) 

1958 China Woode (1969) 

1959 Germany Woode (1969) 

1960 Spain Concellón Martínez (1960) 

1961 Russia Woode (1969) 

1961 Poland Woode (1969) 

1962 Romania Woode (1969) 

1962 The Netherlands Woode (1969) 

1966 Australia Woode (1969); not confirmed 

1966 Africa Woode (1969); not confirmed 

1966 Czechoslovakia Woode (1969) 

1967 Canada Woode (1969) 

1967 Belgium Woode (1969) 


ularly in large breeding farms in which, after a primary outbreak, the virus 
persists in infected weaned nursery pigs. This enzootic form of TGE cannot be 
recognized clinically and it has, therefore, become difficult to determine how 
widespread TGEV is in a country, unless serological surveys were performed 
(Pensaert et al., 1993; Hill, 1989). 

At present, TGE has dominantly been reported in developed areas, which 
could be caused by a higher incidence in these countries, but could also reflect 
that the disease is not an obligatory noticeable disease in developing countries 
(FAO, 1984). In Europe, the prevalence of TGE is decreasing. However, there are 
still pockets with a high incidence of TGE. For instance, in England only 3% of 
the national herd was seropositive for TGEV up to 1984 (Pritchard, 1987], but in 
East Anglia TGEV has been diagnosed virtually every year since 1956 (Pritchard, 
1987), the last serious epizootic occurred from 1980 to 1982 (Pritchard and 
Cartwright, 1982; D. Paton, Central Veterinary Laboratory, UK, personal com- 
munication). Evidence of enzootic TGEV infection was found in 50% of pigs 
studied between 1981 and 1983 in East Anglia. The region, therefore, is a 
reservoir containing the virus. In general, TGEV outbreaks have diminished, 
but the virus is still present in Central Europe. In Belgium the number of 
seropositive fattening pigs has decreased from 15% (Callebaut et al., 1989) to 
7.696, according to a seroepizootiological study carried out between 1989 and 
1990. In this study 596 of the farms appeared to harbor seropositive animals 
(Pensaert et al., 1992). In South European countries such as Spain, the incidence 
of TGE is low. Two geographical zones, Central and East Spain, can be differen- 
tiated. In Central Spain (Castilla and León], sera collected in 1985 and 1988 were 
seronegative for TGEV (Rubio et al., 1987; Lanza et al., 1993a) and the virus was 
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only occasionally diagnosed, generally associated with swine imports (Laviada 
et al., 1988; Lanza et al., 1990). Along the Mediterranean Coast, epizootic 
outbreaks of TGE-like disease have been observed every 3 years since 1980 
(Plana et al 1982; Anon, 1989). TGEV-specific antibodies have been found in 
sows of two different intensive pig-breeding areas: Murcia and Catalonia. A 
survey made in 1987 in the East Coast showed that 1.3% of the breeding pigs 
were seropositive and there was a prevalence of infection in 5% of the farms 
(Cubero et al., 1990, 1993b). 

In the United States, TGE is more of a problem. In 1987 and 1988, TGE was 
reported in about half of the swine herds (Hill, 1989). It was also a major cause of 
viral enteritis, since 2696 of the cases with neonatal diarrhea submitted to 
diagnostic laboratories were caused by TGEV, a figure equal to the enteritis 
caused by Escherichia coli (Hoefling, 1989). In the United States, TGEV has 
created greater economic vulnerability resulting from growth retardation and 
increased susceptibility to other infectious diseases (Hoefling, 1989). But re- 
cently the incidence of TGEV has also decreased in the United States. The 
percentage of farms with at least one sample testing positive for TGEV was 
35.8% [National Animal Health Monitoring System (NAHMS|, 1992], lower 
than the 5096 found in a survey of Midwestern swine in the early 1980s (Egan et 
al., 1982]. In the United States, 24.3% of the farms still vaccinate the sows or 
gilts against TGE (NAHMS, 1992]. 

Recently, a variant of TGEV, the porcine respiratory coronavirus (PRCV}, 
has been described (Pensaert et al., 1986; Callebaut et al., 1988). The discovery 
of this virus was based on a survey carried out in 1984 in Belgium, which 
showed an increase of animals with antibodies to TGEV (up to 68%), with no 
increase in the incidence of TGE in the preceeding winter, and in the absence of 
vaccination (Pensaert et al., 1987). In 1986 and 1987, the virus spread to 10096 of 
the swine farms in Belgium, 6796 of which are still infected. PRCV-free farms 
became infected during the autumn (Pensaert et al., 1992). This pattern, show- 
ing higher incidence of PRCV in the cold seasons, has repeated itself in later 
years both in Belgium and in France (Laude et al 1993). The virus has been 
observed in many European countries (The Netherlands, Denmark, England, 
Spain, and France], where it has now spread to almost the whole swine popula- 
tion (Brown and Cartwright, 1986; Pensaert et al., 1987; Jestin et al., 1987a,b; 
Laviada et al., 1988; Have, 1991; Sánchez et al., 1990). PRCV was detected in 
Spain for the first time in September 1986 (Martin-Alonso et al., 1992). Thirty- 
one percent of sera collected in Castilla-León (Central Spain) and 6496 of the 
farms in this area were seropositive for PRCV (Lanza et al., 1993a,b). In England, 
a survey in 1987 of over 300 elite herds with a high health status revealed that 
80.1% of the herds contained seropositive pigs (D. Paton, personal communica- 
tion}. In the United States, serological tests detected PRCV for the first time in 
Indiana in 1989 (Wesley et al., 1990). Since then, other PRCV isolates have been 
reported in the United States and Canada (Halbur et al., 1993; Jabrane and 
Elazhary, 1993). PRCV strongly cross-reacts with its ancestor TGEV (Callebaut 
et al., 1988; Rasschaert et al., 1990; Sánchez et al., 1990, 1992; Wesley et al., 
19913). PRCV may evoke a protective response to TGEV (Hooyberghs et al., 
1988; Paton and Brown, 1990; De Diego et al., 1992; Laude et al., 1993; Wesley 
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and Woods, 1993). PRCV has also been diagnosed in Asia and Eastern European 
countries (M. Pensaert and J. Musilova, personal communications]. 

A second enteric coronavirus, the porcine epidemic diarrhea virus (PEDV}, 
serologically unrelated to TGEV, was isolated for the first time in Belgium in 
1977 (Pensaert and Debouck, 1978) and then in England (Chasey and Cart- 
wright, 1978) and Spain (Jiménez et al., 1986a). PEDV is now present in all 
European countries. In 1983, two surveys performed in England, in the spring 
and in the autumn, showed 19.2 and 8.3% of the animals positive, respectively, 
which reflects the seasonal incidence of PED. In 1987, 9.2% of the sows tested 
were seropositive for PED in England. Currently, about 1-2% of the fecal 
samples submitted to the Central Veterinary Laboratory from all over England 
are PED positive. PEDV has not been found in the United States (R. Wesley, 
National Animal Disease Center, Iowa, personal communication}. It is intrigu- 
ing that during 35 years of research into TGE, PEDV had not been recognized 
before, suggesting that the infection is a recent event or that in the past its 
diagnosis was less efficient because TGEV-specific reagents were used to detect 
coronaviruses-causing enteritis. 


Il. SUSCEPTIBILITY TO TGEV INFECTION 
A. Tropism and Host Cells 


TGEV is basically an enteropathogen, although natural TGEV isolates can 
also replicate in the respiratory tract (Underdahl et al., 1974; Kemeny et al., 
1975). The virus has occasionally been isolated from tonsils, trachea, and lungs. 
Signs of respiratory disorders were not observed despite the detection of pul- 
monary lesions (Kemeny et al., 1975; Underdahl et al., 1974, 1975). During 
infections with virulent isolates, the highest virus concentrations are found in 
the enteric tract. The jejunum and, to a lesser extent, the ileum and duodenal 
epithelium are the areas with most pathology. TGEV isolates that have been 
passaged in cell cultures gradually lose their tropism for the enteric tract, while 
they gain tropism for respiratory tissues (Harada et al., 1969; Furuuchi et al., 
1978, 1979) and for alveolar macrophages, where TGEV replication is partially 
restricted (Laude et al., 1984). Highly attenuated strains of TGEV, which repli- 
cate in the upper respiratory tract, including tonsils and lungs, do not replicate 
in the intestine of newborn pigs (Furuuchi et al., 1979]. TGEV replicates in the 
apical tubovascular system of villous absorptive cells in newborn pigs; this 
system is absent in pigs older than 3 weeks (Wagner et al., 1973). The virus also 
grows in the mammary tissue of lactating sows (Saif and Bohl, 1983); infected 
sows shed virus in their milk (Kemeny and Woods, 1977). Intrafetal inoculation 
results in the production of villous atrophy (Redman et al., 1978). The suscep- 
tibility of cells from different organs to TGEV should be carefully defined for 
each situation, since it is a function of the virus dose, the status of the swine 
used (gnotobiotic, colostrum deprived, or conventional], temperature, and age 
of the animal, among other factors (Furuuchi et al., 1976; L. Saif, personal 
communication; M. L. Ballesteros and L. Enjuanes, unpublished results). 
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In infected animals, the virus can be recovered from macrophages and other 
cells of the reticuloendothelial system (Underdahl et al., 1974). Macrophages 
from the intestinal mucosa (Chu et al., 1982) and probably Küppfer cells are also 
infected, while porcine blood monocytes are not infected by TGEV (Laude et al., 
1984). The virus has also been isolated from mesenteric lymph nodes and 
Peyer's patches up to 9 days after infection. However, the infectious virus 
persisted only for 3-4 days in T-cell lines derived from these organs (M. J. 
Bullido and L. Enjuanes, unpublished results). 

Even though TGEV usually causes an acute enteric disease, it also leads to 
persistent infections, in which the virus is not detected in the enteric tract (M. 
Pensaert, personal communication). Instead, the virus has been found in the 
respiratory tract of recovered pigs for more than 100 days after infection (Under. 
dahl et al 1974, 1975). TGEV mutants can persistently infect adult swine with 
continuous virus shedding in the gut (R. Wesley and R. Roods, personal commu- 
nication]. 

After oronasal inoculation, strain BEL85 of PRCV replicates in the nasal 
mucosa, tonsils, trachea, bronchi, bronchioles, and alveoli and alveolar macro- 
phages, while in the intestine a few cells stained positively with immuno- 
fluorescence. Higher virus titers (>108 TCID,, jg Of tissue) have been detected in 
the lungs (O’Toole et al., 1989; Cox et al., 1990a; Laude et al., 1993). Other 
tissues, including plasma (viremia was observed), mesenteric lymph nodes, and 
colon were consistently positive, while the virus was sporadically isolated from 
other lymph nodes, spleen, liver, and thymus. Even when the virus was inocu- 
lated directly to the lumen of the small intestine, a limited degree of virus 
replication, in a few enterocytes located in the transition from the crypts to the 
villi in the jejunum, was observed (Pensaert et al., 1987). The cells were identi- 
fied as villous enterocytes by electron microscopy (Popischil et al., 1990) and 
immunocytochemistry (O’Toole et al., 1989). PRCV is not found in rectal swab 
samples (Vancott et al., 1993) unless it is administered to the pig artificially via a 
stomach tube (Wesley and Woods, 1993). A minimum of 10? TCID,, of PRCV 
were needed to start the infection in the intestinal tract (Cox et al., 1990b]. This 
result indicates that the gut is not the target organ for PRCV and that the ability 
of these coronaviruses to infect a tissue is not an all-or-nothing phenomenon. 

It has been debated whether PRCV causes respiratory disease or not. Signs 
of respiratory disorders caused by PRCV have not been observed by Pensaert's 
group, either after field infections or after experimental aerosol inoculation, 
using procedures that produce clinical signs using respiratory viruses such as 
influenza viruses and Aujeszky disease virus (Pensaert et al., 1987). Also, exper- 
imental inoculation of pigs with the British PRCV has not produced disease, 
other than very occasional and mild rhinitis, although histopathological lesions 
are present (O'Toole et al., 1989). In England, isolation of PRCV has been 
associated with respiratory problems, frequently with concurrent swine influ- 
enza (Lanza et al., 1992). In contrast, two groups (Duret et al., 1988; Van 
Nieuwstadt and Pol, 1989) have reported that PRCV can cause pneumonia. The 
pathology caused by various PRCV strains isolated in the United States also 
range from inapparent to severe bronchointerstitial pneumonia, which has been 
observed more recently in gnotobiotic pigs. Lesions in conventional pigs were 
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less severe, however. The pigs developed mild disease and fever (Halbur et al., 
1992). The Canadian isolate PRCV IQ90 produced morbidity and mortality 
rates reaching 100% and 60%, respectively (Jabrane and Elazhary, 1993). Con- 
flicting results have been obtained with different PRCV isolates which differ in 
their genetic structure (see Section IILA.). This may explain the different re- 
sults. 


B. Virus Receptors 


Two cell surface proteins appeared to be relevant to the entry of TGEV into 
susceptible cells: aminopeptidase N (APN) and a recently described 200-kDa 
protein. APN, an N-terminal exopeptidase with preference for neutral amino 
acids, has been clearly shown to act as a major receptor for TGEV and PRCV in 
cultured cells (Delmas et al., 1992, 1993). APN is identical to CD13, a surface 
protein abundantly expressed in the brush border membranes of intestinal 
epithelial cells and fibroblast, on the apical surfaces of lung and renal cells, as 
well as in granulocytes, monocytes, and their bone marrow progenitors (Norén 
et al., 1986; Look et al., 1989). APN plays a role in the digestion of peptides in the 
gut (Kenny et al., 1987). The distribution of APN strongly suggests that it is also 
major receptor for TGEV and PRCV in vivo. Two key experiments provide 
evidence for the involvement of APN as a receptor for TGEV and PRCV. First, 
cells refractory to these viruses became susceptible after being transfected with 
the cDNA encoding porcine APN. Second, monoclonal antibodies (MAbs) spe- 
cific for this protein efficiently block the multiplication of these viruses in cell 
culture. APN also is the major receptor for human coronavirus 229E (Yeager et 
al., 1992) and for cytomegalovirus (Söderberg et al., 1993). 

Additional factors are probably involved in the susceptibility of intestinal 
epithelial cells both for TGEV and PRCV. Since intestinal cells are not infected 
(or only infected with extremely low efficiency) by PRCV, the existence of an 
additional factor involved in virus entry or in later steps of replication has been 
proposed (Sánchez et al., 1992; Delmas et al., 1993; Laude et al., 1993]. A similar 
situation exists in murine coronaviruses (Yokomori et al., 1993). On the viral 
side, there is evidence that two regions of the genome might be involved: the 
5' terminus of the S gene is deleted or altered in viruses that have lost enteric 
tropism (Sánchez et al., 1992; Delmas et al., 1993; Wesley et al., 1991a; Britton et 
al., 1991), alternatively, the open reading frame (ORF) 3a that codes for a 71-72 
amino acid nonstructural polypeptide (Godet et al., 1992), which has been 
converted to a pseudo-gene in many respiratory virus (Rasschaert et al., 1990; 
Wesley et al., 1989; Laude et al., 1993), might also be a determinant of tropism 
and virulence for coronaviruses related to TGEV. It was suggested that the high 
susceptibility of newborn piglets to TGEV infection and the tropism of the 
virus for villous enterocytes may be related to a 200-kDa protein. Experimental 
evidence for the existence of a second receptor binding site for TGEV has 
recently been provided (Weingartl and Derbyshire, 1993a,b). A saturable, spe- 
cific binding of TGEV to the plasma membrane of the villous enterocytes in 
neonate swine has been shown. This binding is inhibited by MAbs that recog- 
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nize a 200-kDa protein but not APN (Weingartl and Derbyshire, 1993a,b; 1994). 
This protein was present in tiny or undetectable amounts in cryptal enterocytes 
of newborn swine or villous enterocytes from weaned pigs, less susceptible to 
TGEV. At early times post-birth (less than 3-day old piglets) the villi are covered 
with APN containing epithelial cells, while the 200-kDa protein is only present 
on the tip of the villi and in a few epithelial cells of the villi. 


C. Effect of Age on Infection 


A clear relationship between TGEV pathogenicity and the age of the in- 
fected animals has been established. It has been shown that the dose of virus 
needed to infect an adult swine is 104-fold greater than that required to infect a 
neonate (Witte and Walther, 1976). Epizootic outbreaks affect all age groups and 
spread through the herd in 2 or 3 days. Only pigs up to 3 weeks of age often 
vomit and develop watery diarrhea that leads to dehydration, rapid weight loss, 
and death in 2 to 5 days. Mortality in these young pigs often approaches 100%. 
In weanlings (3 to 8 weeks old), mortality is usually less than 10 to 20%, but 
impaired feed adsorption causes growth retardation. Adult pigs become inappe- 
tent and develop diarrhea that usually lasts only 2 to 4 days. Although mor- 
bidity may approach 10096, mortality in this age group is usually less than 596 
(Hill, 1989). 

A sharp difference in the susceptibility of newborn swine to TGEV, depend- 
ing on their access to colostrum and milk, even when the sows were seronega- 
tive for TGEV, has been observed (Furuuchi et al., 1976; M. L. Ballesteros, C. M. 
Sánchez, J. Plana, and L. Enjuanes, unpublished results). These data suggest a 
specific effect of the colostrum on intestinal epithelial cell differentiation 
toward a TGEV-resistant state. This colostrum factor may influence the expres- 
sion of the 200-kDa or APN TGEV receptors. Alternatively, factors present in 
the colostrum may interfere with binding of the virus to the receptors. Conven- 
tional colostrum-deprived newborn swine were fully susceptible (10096 mortal- 
ity) to high cell passage TGEV (PUR46-MAD strain), while 86% of their litter- 
mates that had access to colostrum and milk for 7 hr showed no disease 
symptoms (M. L. Ballesteros, C. M. Sánchez, J. Plana, and L. Enjuanes, unpub- 
lished results). Similar results have been obtained with gnotobiotic pigs and 
newborn animals that had access to colostrum, using attenuated strains of 
TGEV (M. Welter, personal communication). Thus, susceptibility to TGEV is a 
function of several host factors: age of animals, access to colostrum, and envi- 
ronmental conditions, in addition to the dose and virulence of the virus strain. 


D. Transmission of the Virus 


The main reservoir for TGEV and related viruses is probably the pig. Both 
fattening swine and lactating sows are responsible for transmission of TGEV to 
young animals. Contamination originates from feces, the milk, and aerosols 
generated in the respiratory tract (Kemeny et al., 1975; Kemeny and Woods, 
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1977, Kemeny, 1978). TGEV can also replicate in dogs, cats, foxes, starlings, 
domestic flies, and possibly in skunks, opossums, and muskrats (Saif and 
Wesley, 1992). No signs of infection have been detected in mice, rats, gerbils, or 
guinea pigs (Garwes, 1982). 

TGEV replicates in the mammary gland of lactating sows, and infectious 
virus can be recovered from their milk and possibly be transmitted to piglets 
(Saif and Bohl, 1983). Intrauterine transfer of PRCV and TGEV has not been 
observed (Paton and Brown, 1990; Saif and Bohl, 1983). Transmission by the 
aerogenic route in TGEV epizootiology is under debate. Some authors suggest 
that it is relevant since infectious virus has been isolated 100 days after infec- 
tion in the breath of infected pigs (Underdahl et al., 1975; Torres-Medina, 1975). 
Other authors suggest that the aerogenic route is not important for TGEV, since 
the efficiency of spreading by this route was much lower than that for PRCV 
(Laude et al., 1993). By contrast, PRCV spreads via air, based on the following 
findings: (1) aerosolized virus initiates the infection; (2) the virus disseminates 
very rapidly; (3) the virus transferred between farms with no apparent links; and 
(4) the virus spreads in countries with high hygienic standards, free of TGEV 
(Pensaert et al., 1986; Jestin et al., 1987a,b; Henningsen et al., 1988; Have, 1991). 
PRCV is secreted orally nasally. There is no indication that the fecal-oral 
transmission plays a role in the epizootiology of the natural infection. The 
transmission of PEDV in infected farms in the center of Spain could not be 
easily explained, other than by the aerogenic route (Jiménez et al., 1986a). 


III. ANTIGENIC AND GENETIC VARIATION 
A. Genome Organization and Virus Structure 


TGEV contains a single-stranded positive-sense RNA genome of around 30 
kb, which is infectious (Brian et al., 1980], and generates eight mRNAs, includ- 
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FIGURE 1. Comparison of the genomes of TGEV and related coronaviruses. ORFs are indicated. 
CCV ORF 3x is most likely not expressed. ORF 3 of FIPV is not well defined, although a 5.2-kb 
mRNA corresponding to this ORF has been described (de Groot et al., 1987; Vennema et al., 1992). 
Solid rectangles indicate identified or predicted ORFs; dotted rectangles indicate pseudogenes for 
which no mRNA has been detected; A, indicates presence of deletions; UTR, untranslated regions; 
An, poly A. The sequences used have been previously reported: TGEV-PUR46 [Rasschaert et al., 
1987), TGEV-MIL65 (Wesley et al. 1989; Wesley, 1990); PRCV-FRAS6 (Rasschaert et al., 1990), CCV 
(Horsburgh et al., 1992); FIPV (de Groot et al., 1987; Jacobs et al 1987; Vennema et al., 1992). The 
drawing is not to scale. 


ing the genomic size mRNA (mRNA 1] and a nested set of seven subgenomic 
mRNAs. The mRNAs have a common leader of about 90 nucleotides (nt)-long 
(Page et al., 1990; Sethna et al., 1989) and poly A on the 3’ end (Jacobs et al., 
1986). These RNAs are named mRNAs 2 or S, 3, 3-1, 4 or sM, 5 or M, 6 or N, and 
7, after the ORFs that are encoded by them (Fig. 1). The nomenclature used is 
according to the Coronavirus Study Group (Cavanagh et al., 1990, 1994). These 
genes code for four structural proteins (S, sM, M, and N) (Jacobs et al., 1986; 
Rasschaert et al., 1987; Godet et al., 1992). mRNA 7 codes for a potential fifth 
membrane-associated structural protein of 9 kDa (Tung et al 1992). A compari- 
son of TGEV genome organization with that of closely related coronaviruses is 
shown (Fig. 1). TGEV, PRCV, canine coronavirus [(CCV|, and feline infectious 
peritonitis virus (FIPV) have a similar genome organization with some differ- 
ences. FIPV and CCV have two ORFs at the 3’ end of the genome, instead of the 
single ORF7 in TGEV. 

The most abundant structural proteins in TGEV are S, M, and N. S protein 
is the main inducer of neutralizing antibodies (Laude et al., 1986; Jiménez et al 
1986b). Four antigenic sites (A, B, C, and D] have been recognized on the spike 
protein; site A can be subdivided in three subsites (Aa, Ab, and Ac) (Correa et al., 
1988, 1990). These sites have been mapped on the S protein (Fig. 2) in the order 
C, B, D, and A, starting at the N-terminal end. Site C is located between amino 
acid 49 to 52; site B is between residues 97 to 144; site D between residues 382 to 
389; and site A, around residues 538 (Aal, 543 (Ac), 586 [Aa-Ab], and 591 (Ab) 
(Correa et al., 1990; Enjuanes et al., 1990; Gebauer et al. 1991). Three of these 
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sites (B, D, and A} overlap with sites defined by Delmas et al. (1990a,b}, who 
named them D, C, and A.B, respectively (Fig. 2). By competitive binding studies 
using MAbs from different laboratories, more than ten antigenic sites could be 
differentiated, five of which were involved in the neutralization of the virus, 
although site A was the major inducer of neutralizing antibodies in all laborato- 
ries (D. Garwes, H. Laude, R. Wesley, R. Woods, and L. Enjuanes, unpublished 
data). The second important sites involved in in vitro neutralization are sites 
D and B. Only some of the MAbs specific for these sites are neutralizing (Laude 
et al., 1986; Jiménez et al., 1986b; Correa et al., 1988; Posthumus et al., 1990a,b; 
Van Nieuwstadt et al., 1988). Using neutralizing MAbs, Hohdatsu et al. (1987) 
have defined at least six different epitopes involved in the neutralization of 
TGEV isolates, which these authors classified into four groups. Interestingly, 
some critical epitopes for neutralization by MAbs appeared at high (176th!) 
passage levels but were not present at a low (17th) passage of the Toyama strain 
of TGEV. 


B. Evolution of TGEV Structural Proteins S, M, N, and sM 


The binding of MAbs to TGEV, PRCV, and related coronaviruses demon- 
strated antigenic diversity in the three major viral proteins (Fig. 3). Among 
TGEV isolates, there is more diversity in the S protein than in the M or N pro- 
tein. This might be related to the important biological activities of the corona- 
virus S protein. It is involved in the fusion of infected cells (Frana et al 1985; 
Fazakerley et al., 1992]; it binds to receptors on the cell surface (Holmes et al., 
1981, 1989; Delmas et al., 1992, 1993), it is involved in the pathogenesis of 
murine hepatitis virus (MHV) (Fleming et al., 1986); and it is the major inducer 
of complement-independent neutralizing antibodies (Jiménez et al., 1986b; Del- 
mas et al., 1986). Of the four antigenic sites defined (C, B, D, and A) (Correa et 
al., 1988, 1990; Delmas et al., 1986; Gebauer et al., 1991), sites C and B were 
particularly variable. Extending the comparison to PRCV, FIPV, feline enteric 
coronavirus (FECV) and CCV, these isolates also showed a wide variation in the 
sites C and B. Site D has an intermediate level of conservation, while site A was 
most conserved, particularly subsite Ac, which is present in porcine, feline, and 
canine coronaviruses (Fig. 3). From an antigenic point of view, the main differ- 
ence observed between the attenuated PRCVs and the virulent TGEVs is the 
absence of the B site on the virulent isolates: MIL65, SHI56, and MAD88 (Fig. 3) 
(Sánchez et al., 1990], indicating that this site is different in the virulent strains. 

Comparison of S protein sequences of TGEVs and PRCVs (Fig. 4) has shown 
that the S protein from three TGEV strains, MIL65, BRI70-FS772, and TOY56, 
has 1449 amino acids, two more residues than three clones of the high passage 
PUR46 strain and two vaccine strains (Sánchez et al. 1990; Register and Wesley, 
1994; C. M. Sanchez and L. Enjuanes, unpublished results). The two amino acid 
deletion is not present in the European PRCVs. An identity of 97% between the 
S-proteins of MIL65 and the PUR46 strains was observed at both amino acid and 
nucleotide level. Seventy-two percent of the amino acid changes were located 
within the N-terminal half of the S protein, which comprises the exposed 
globular-shaped portion of the peplomer. Only nine residue differences occur 
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FIGURE 3. Binding of MAbs to coronaviruses. The value of the MAb binding to the PUR46-CC120- 
MAD strain, determined by RIA, was taken as the reference value (100). The characteristics of the 
viruses used and the specificity of the MAbs have been reported (Sánchez et al., 1990). The antigenic 
homology of each virus isolate relative to the reference virus PUR46-CC120-MAD was expressed as 
a percentage (Sánchez et al., 1990}. Symbols: C, 0 to 30; Zi, 31 to 50; W, 51 to 100. The antivirus sera 
were TGEV specific in the case of TGEV, PRCV, FIPV, FECV, and CCV and specific for the 
homologous virus in the case of PEDV, HEV, HCV 229E, and MHV. ND, not determined. 


within the C-terminal half of the peplomer, which encompasses the stalk 
structure and the membrane anchoring domain. The N-terminal amino acid 
variation in the S-protein has also been detected in MHV and avian infectious 
bronchitis virus (IBV) (Luytjes et al., 1987; Kusters et al. 1989). In this region (S1) 
of IBV virus, there are two areas of high amino acid variability (Niesters et al. 
1986; Cavanagh et al., 1988). No clustering of amino acid changes indicative of 
highly variable domains were apparent in the S protein of TGEV. The amino 
acid changes in the S protein of MIL65 and PUR46 strains of TGEV are appar- 
ently substitutions that do not affect epitopes involved in in vitro TGEV neu- 
tralization, since neutralizing MAbs representing five different noncompeting 
sites were unable to distinguish between these TGEV strains (Wesley, 1990). All 
these observations on the S protein of TGEV indicate that the peplomer protein 
is highly conserved among TGEVs. 
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The genetic relationship among enteric and respiratory isolates of the 
TGEV cluster has been determined, based on the RNA sequences of the S 
protein (Sánchez et al., 1992). An evolutionary tree relating these isolates indi- 
cated that a main virus lineage evolved from a recent progenitor virus that was 
circulating around 1941 (Fig. 5). From this, secondary lineages originated, 
PUR46, TOY56, MIL65, BRI70, and the PRCVs, in this order. Least-squares 
estimation of the origin of TGEV-related coronaviruses showed a significant 
constancy in the fixation mutation rate with time, that is, the existence of a 
well-defined molecular clock. A mutation fixation rate of 7 € 2 x 1074 nucleo- 
tide substitutions per site and per year was calculated for TGEV-related viruses. 
This rate falls into the range reported for other RNA viruses. Sequencing data of 
the S gene of PRCV virus indicate that there is a high similarity, as compared 
with TGEV, in the areas encoding antigenic sites A and D. A deletion of 672 
nucleotides in the 5' region, which in TGEV codes for sites C and B, leaves 
PRCV without these sites (Callebaut et al., 1988; Rasschaert et al 1990; Sán- 
chez et al., 1990, 1992; Wesley et al 1990; Britton et al., 1991). PRCV was 
detected about 40 years later than TGEV in Europe, where it spread very fast. 
All the European PRCVs have an identical deletion of 224 amino acids in the 
same position within the amino terminal half of the spike protein, suggesting 
that they were all derived from the same precursor (Sánchez et al., 1992). In 
contrast, American PRCVs, which were detected for the first time in 1989 
(Wesley et al., 1990), have deletions of different sizes (207 to 227 amino acids} 
located in slightly different positions (Fig. 4) (Vaughn et al 1994; Halbur et al., 
1992, 1993; P. Paul, personal communication|, suggesting that they originated 
independently. 
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FIGURE 5. Evolutionary tree of TGEV related coronaviruses. Neighbor-joining and least-squares 
methods of tree reconstruction procedures were applied to the first 1956 nt of 13 virus isolates 
(Sánchez et al., 1992). Numbers in the diagram indicate residue substitutions between branching 
points. A, Indicates the introduction of a deletion between branching points; *, indicates that all the 
descendents of this fork have, with a probability of 99.996, a recent common ancestor. 
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A comparison of the primary structure of the peplomer proteins of TGEV 
and FIPV and of the 3' end of their genome indicates that they are closely 
related, both in their structural and nonstructural proteins (de Groot et al., 
1987; Jacobs et al 1987). Sequence analysis of the S protein genes revealed one 
domain, from amino acids 1 to 274, in which the nucleotide sequences were 
3996 similar, whereas in the second domain, from residues 275 to 1447, the 
identity was 93%. Comparison of the 3’ ends of the FIPV and TGEV genomes 
revealed that the highest sequence similarity (98.5%) is in the 3'-noncoding 
sequences. Large insertions and deletions in the FIPV and TGEV genomes were 
detected that could be the result of RNA recombination events (de Groot et al., 
1988; Vennema et al., 1992). The amino acid sequence of the CCV S protein has 
91.196, 81.696, and 2496 identity with FIPV, TGEV, and MHV, indicating a closer 
relationship with FIPV. This conclusion is reversed, however, if the M and N 
proteins are compared (Horsburgh et al 1992). The CCV S protein has 1452 
amino acids, slightly larger that the 1447 or 1449 of TGEVs. 

The identity between the M proteins of two TGEV strains (ENG70 and 
PURAO] is 98% at the nucleotide and amino acid level (Britton et al., 19883). 
There are 11 or 12 amino acid substitutions between the British and the PUR46 
strain, as determined by Kapke et al. (1987) or by Laude et al. (1987), respec- 
tively. 

In the TGEV N protein, three antigenic domains (A, B, and C) have been 
defined using MAbs (Martín-Alonso et al., 1992). By studying the binding of 
these MAbs to virus isolates, it was shown that the three domains were highly 
conserved in TGEV isolates, but that B domain differs between TGEVs and 
PRCVs (Sánchez et al., 1990). The amino acid sequences of the N protein genes 
from the British (ENG70| and the PUR46 strains of TGEV are 97.9% identical, 
and most of the changes are relatively conservative (Kapke and Brian, 1986, 
Britton et al., 1988b). The data available on the three structural proteins of 
TGEV isolates indicate that there is also high conservation among isolates 
collected from remote areas and analyzed after different passage numbers in cell 
culture. 

The sM protein of TGEV has been recently described [Godet et al 1992). A 
comparison of the nucleotide sequences of sM genes from virulent (BRI7O, 
MIL65] or attenuated (PUR46, PTV-ATT] TGEVs and PRCVs (ENGII, FRA86} 
showed a high degree of conservation (Fig. 6) (C. M. Sánchez, M. L. Ballesteros, 
and L. Enjuanes, unpublished results). 


C. The Nonstructural Proteins 


The genes coding for three nonstructural proteins of TGEV (nsp 3, 3-1, and 
7) of seven strains of TGEV, including two European PRCVs (ENGII and FRA86) 
and one American isolate (IND89), have been aligned (Fig. 6). Gene 3 is one of 
the areas of major genetic variability among TGEVs. There are major differ- 
ences between the noncoding areas of these genes (Fig. 6) (Wesley et al., 1989, 
1990, 1991a,b; Britton et al., 1989, 1991; Rasschaert et al., 1990; Laude et al., 
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1993; C. M. Sánchez and L. Enjuanes, unpublished results). Between the 3' end 
of the S gene and the start of gene 3, there are deletions in six different positions, 
in the seven viral isolates. In contrast, in ORFs 3-1 and 4 (translated and 
nontranslated areas), deletions were only observed in two and one position, 
respectively. The presence of a deletion (ENGII, FRA86} or a mutation (IND89] 
has altered the ORF 3 consensus sequence CUAAAC in PRCY isolates, prevent- 
ing its expression. In contrast, ORF 3-1, which is expressed in PRCVs, is not 
functional in TGEVs isolates: PUR46, PTV-ATT, and BRI70 due to alterations 
in the consensus sequence. Finally, TGEV MIL65 strain expresses both 3 and 3-1 
ORFs. In ORFs 3 and 3-1 amino acid identity was higher than 95%. The lack of 
expression of ORF3 in PRCVs and ORF3-1 in some TGEVs indicates that they 
are not essential for in vitro or in vivo replication. Although, alterations in 
these ORFs might be involved in the pathogenicity of these isolates. 

CCV and FIPV have ORFs equivalent to the TGEV ORFs 3 and 3-1. In 
addition, CCV has an extra ORF (named 3x) that, to date, has not been detected 
in this group of viruses and could potentially encode a 71 amino acid protein 
overlapping with ORFs 3a and 3b of CCV. Due to the context of the ORF, it is 
very unlikely that it is expressed and probably represents an evolutionary 
redundant sequence that is no longer required by the virus (Horsburgh et al., 
1992). Amino acid identity in ORFs 3 and 3-1, between TGEV and CCY, is high: 
83.596 and 92.796, respectively. 

The genomic organization at the 3' end of TGEV differs from that of CCV 
and FIPV in that the latter viruses contain additional ORFs (Fig. 1) (de Groot et 
al., 1988; Vennema et al., 1992; Horsburgh et al., 1992). At the 3’ end, TGEV has 
ORF 7, CCV has ORFs 7a and 7b, while FIPV has ORFs 6a and 6b. In addition, 
TGEV ORE 7 has a 69 nt deletion. These nucleotides are present in all FIPV 6a 
ORFs. The presence rather than absence of ORF 6b appears to be the common 
theme in FIPV and CCV, strongly suggesting that these nucleotides have been 
lost in TGEV (Vennema et al., 1992). Paired comparison of the colinear parts of 
the amino acid sequences of the TGEV and CCV ORF 7, with FIPV ORF 6, have 
shown an identity of 96% and 77%, respectively (Vennema et al., 1992). 

The antigenic and genetic homology among the isolates of the TGEV 
cluster strongly suggests that they have diverged from a common ancestor. 
Differences in the antigenic and genomic maps of coronaviruses, including 
TGEV and FIPV, show that in their divergence certain transcription units have 
been lost, gained, or translocated (de Groot et al., 1987, 1988; Sánchez et al., 
1990, 1992). There are three regions where deletions can occur at a higher 
frequency: within the S gene, between the S and M genes, and downstream of 
the N gene (Wesley et al., 1990, Rasschaert et al., 1990; Britton et al., 1991; 
Sánchez et al., 1992; Horsburgh et al., 1992; Vennema et al., 1992). The rele- 
vance of RNA recombination to the evolution of these viruses has been sug- 
gested by the occurrence of recombination during TGEV infections (Ballesteros 
et al., 1994). The porcine, feline, and canine coronaviruses can infect the same 
cell type in the gut of their natural host species (Reynolds et al., 1980; Stoddart 
and Scott, 1989; Woods et al., 1981), which would facilitate recombination 
between them. More specifically, CCV (Woods and Wesley, 1986), FIPV (Woods 
et al., 1981), and obviously TGEV can infect swine. 
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IV. CLASSIFICATION OF TGEV AND RELATED 
CORONAVIRUSES 


Based on the antigenic cross-reaction among coronaviruses, four antigenic 
groups have been defined (Siddell et al., 1982; Sturman and Holmes, 1983; Wege 
et al., 1982; see also Chapter 1, this volume], two of which include mammalian 
viruses and the other two, avian viruses. In this grouping several isolates, not 
all of which are yet recognized as coronavirus species, remain unclassified: 
namely, hemagglutinating encephalomyelitis virus (HEV) (Mengeling et al., 
1972). human enteric necrotic coronavirus (HENCV] (Resta et al., 1985); rabbit 
coronavirus (RbCV) (Small and Woods, 1987); mink coronavirus [which has 
been tentatively related to TGEV and PEDV (Have et al., 1992), although these 
results have not been confirmed], and cheetah coronavirus (ChCV}, which is 
antigenically related to the feline coronaviruses by immunofluorescence (Ever- 
mann et al., 1989). The RbCV could be related to TGEV based on in vivo protec- 
tion experiments (Small and Woods, 1987), but there is evidence that this virus 
may have a gp65 glycoprotein and a proteolytically cleaved peplomer (Desco- 
teaux et al., 1985), both of which are not characteristic for the TGEV group. 

TGEV and related viruses can be classified on the basis of their protein 
composition and antigenic properties. The protein composition of TGEV and 
related coronaviruses (Table II) shows that these viruses have three families of 
abundant proteins with molecular weight 160-220 kDa (S protein}, 47-56 kDa 
[N protein], and 22-36 kDa (M protein) (Laude et al., 1986; Jiménez et al., 
1986a,b; Cavanagh et al., 1990, 1994). In addition, TGEV has another structural 
protein, sM [Godet et al 1992). Some coronaviruses have yet another structural 
glycoprotein (HE, gp65}, which forms dimers through disulfide bridges. These 
dimers form a second “crown” below the one formed by the S protein and have 
hemagglutinating activity. TGEV and the related coronaviruses PRCV, FIPV, 
FECV, and CCV (Sanchez et al. 1990) do not have gp65. Other coronaviruses, 
including rat coronavirus (RCV), human coronavirus (HCV) OC43, HENCV, 
bovine coronavirus (BCV), and diarrhea virus of infant mice (DVIM) have gp65 
as an abundant component. MHV and HEV contain variable amounts of the 
hemagglutinin esterase protein, depending on the particular isolate or the host 
cell. 

Another characteristic useful for classification of coronaviruses is the 
cleavage of the spike protein into two halves. The peplomer protein of TGEV 
and related isolates (PRCV, FIPV, FECV, CCV, and HCV 229E) is not cleaved, in 
contrast to the antigenically unrelated coronaviruses: turkey coronavirus 
(TCV), HCV OC43, BCV, MHV, IBV, and, tentatively, HEV. These later viruses 
have a protease cleavage site on the peplomer protein that is absent on the 
uncleaved S proteins (Table II and references therein). 

The binding of 42 MAbs, which recognized at least 25 epitopes, to enteric 
isolates of TGEV, collected during a period of 41 years in three distinct geo- 
graphical areas (America, Europe, and Asia), PRCV isolates, and other coro- 
naviruses, showed a strong antigenic homology among the corresponding viral 
proteins (S, M, and N) of TGEV, PRCV, FIPV, FECV, and CCV (Fig. 3). In contrast, 
no cross-reactivity with PEDV, HCV 229E, HCV OC43, HECV, BCV, or MHV 
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was detected (Sánchez et al., 1990). Interestingly, all TGEV, PRCV, FIPV, FECV, 
and CCV shared the antigenic subsite Ac, defined by three MAbs that neutral- 
ized all these isolates (Sánchez et al., 1990). The presence of the antigenic 
subsite Ac in a coronavirus could be taken as the basis to define an antigenic 
cluster, which groups TGEV, PRCV, FIPV, FECV, and CCV with all members 
having: (1) common epitopes in the three structural proteins; (2) no conven- 
tional cleavage site in the peplomer protein; and (3) no gp65 glycoprotein. 

HCV 229E was classified into the TGEV serological group (Pedersen et al., 
1978) based on the binding of polyvalent antibodies, as determined by immuno- 
fluorescence. In contrast, no reactivity was detected by radioimmunoassay 
(RIA) or ELISA (Sánchez et al 1990), with a panel of 42 MAbs (Correa et al., 
1988) or with a second collection of independently derived MAbs (Laude et al., 
1986; P. Talbot, personal communication). Also, the binding of TGEV- or HCV 
229E-specific polyvalent antisera adsorbed with noninfected cells to these vi- 
ruses in a RIA was absent or slight (Sánchez et al., 1990). Reynolds et al. (1980) 
did not detect neutralizing activity for CCV or TGEV in the antisera specific to 
the 229E strain of human coronavirus. This suggests that the cross-reactivity 
observed by Pedersen et al. (1978) could be due to nonstructural antigenic 
determinants (Garwes, 1982). No cross-hybridization between the HCV 229E 
RNA and a cDNA probe complementary to the N and M genes of TGEV has 
been reported, while a strong hybridization was observed with TGEV, FIPV, and 
CCV (Shockley et al., 1987). 

Sequence data, in contrast to antigenicity and cross-hybridization data, 
have shown significant sequence homology in three genes coding for the main 
structural proteins of HCV 229E, PEDV, and other TGEV-related viruses. On 
the basis of S gene sequences, there is a closer relationship of PEDV with the 
TGEV-related subset than with the MHV-related subset and IBV (Duarte and 
Laude, 1994), confirming the reported sequence homology between these vi- 
ruses in their N and M genes (Raabe and Siddell, 1989; Schreiber et al., 1989; 
Bridgen et al., 1993; Duarte et al., 1994). A multiple alignment of the S protein 
sequences of PEDV, TGEV, FIPV, CCV, and HCV 229E has been constructed 
(Duarte and Laude, 1994}, and a significant sequence relationship between 
PEDV and HCV 229E was found, with level of identity of 6096 (S2 region) and 
37% (S1 region). The percentage identity of the PEDV N protein with these 
coronaviruses ranged from 12 to 19% with MV IBV, HCV OC43, and BCV and 
from 32 to 37% with FIPV, CCV, PRCV, TGEV, and HCV 229E (Bridgen et al., 
1993). A comparison of the amino sequences of the M proteins showed that the 
HCV 229E M protein has a higher sequence similarity to the homologous 
protein of TGEV (HCV/TGEV, 68%; HCV/MHV, 5896; HCV/BCV, 5796; and 
HCV/IBV, 52%) (Raabe and Siddell, 1989). 

Some cross-reactivity, as detected by immunoblotting, has been reported 
among the N proteins of various coronaviruses previously considered anti- 
genically unrelated: FIPV, PEDV, and HEV (Yaling et al., 1988); TGEV, HCV 
229E, MHV 3, and HEV (Yassen and Johnson-Lussenburg, 1978), and MHV-A59 
and IBV-M41 (H. G. M. Niesters, unpublished observations). More work is 
needed to determine whether all coronaviruses are antigenically related at the 
N-protein level. 

In spite of the sequence homology between PEDV and viruses related to 
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TGEV, no antigenic cross-reaction has been detected using both polyvalent or 
MAbs with this group of viruses (Pensaert and De Bouck, 1978; Garwes and 
Reynolds, 1981; Callebaut et al., 1988; Enjuanes et al., 1990; C. M. Sánchez and 
L. Enjuanes, unpublished results). Also, no antigenic relationship was detected 
between TGEV and HEV. These results are interesting, in the case of PEDV, as it 
causes a disease similar to the one produced by TGEV. In summary, using 
sequencing data there is a cluster of viruses formed by TGEV, PRCV, CCV, FIPV, 
FECV, PEDV, and HCV 229E. Using serological criteria, an antigenic cluster 
including TGEV, PRCV, CCV, FIPV, and FECV is differentiated. 


V. DIAGNOSIS OF TGEV, PRCV, AND RELATED 
CORONAVIRUSES 


A rapid diagnosis of TGE is important to discriminate it from enteritis 
caused by enteropathogenic E. coli, in order to determine if treatment with 
antibiotics is required. Several techniques have been developed for TGEV diag- 
nosis: immunofluorescence, reversed passive hemagglutination, ELISA, RIA, 
and hybridization with DNA probes. TGEV antigens can be specifically de- 
tected by immunofluorescence. Coronaviruses of the TGEV cluster can be 
distinguished with type-, group- and interspecies-specific MAbs (Sánchez et al., 
1990). Reversed passive hemagglutination, based on the agglutination of erythro- 
cytes coated with TGEV-specific antibodies, is more sensitive than immuno- 
fluorescent staining of primary cultures of porcine kidney cells inoculated with 
the specimens (Asagi et al 1986). It is also simple and rapid. 

PRCV induces a serological response that originally could not be distin- 
guished from that of TGEV-infected swine. This was a considerable drawback, 
since the movement of pigs between countries is frequently restricted until 
evidence that the stock is specifically free from TGEV infection has been 
obtained. This requires a diagnostic procedure capable of differentiating the two 
viruses. For this purpose, competitive RIA (Sánchez et al., 1990) and ELISA 
(Callebaut et al.. 1989) with type- and group-specific MAbs have been devel- 
oped. These assays are sensitive tests for the detection of epitope-specific 
antibodies, and provide no false-positive results. As a small percentage of false- 
negative results may occur with this test, a negative result must be confirmed, 
using several serum samples from the same farm. A cDNA clone containing 
396 base pairs (bp) from the 5’ end of the TGEV S gene of MIL65 virus has been 
used to differentiate TGEV from PRCV. The probe also hybridizes to CCV but 
not to FIPV (Wesley et al., 1991b). 

To differentiate coronaviruses from other agents that cause similar disease 
and to determine the mechanism by which coronaviruses perpetuate enzootic 
or epizootic outbreaks, cloned cDNA probes, representing 2 kb from the TGEV 
genome, have been used in dot blot hybridization assays to detect viral RNA 
from cell culture and from fecal specimens. The cloned sequence encompasses 
the 3'-noncoding region, the nucleocapsid protein gene, and a large portion of 
the membrane protein gene. 32P-labeled cDNA probes prepared from these 
clones detected as little as 23 pg of homologous RNA, but did not detect RNA 
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from the nonrelated virus, even when amounts of up to 10 ng per dot were used. 
In cell culture fluids, these probes detected TGEV, FIPV, and CCV, but not HCV 
229E, HCV OC43, BCV, HEV, and MHV A59 (Shockley et al., 1987). 

To differentiate PEDV from TGEV and other porcine coronaviruses, an 
ELISA blocking test, based on crude virus preparations (Callebaut et al., 1982), 
or an ELISA test that uses purified PEDV adapted to grow on Vero cells 
(Hofmann and Wyler, 1990) have been developed. 


VI. IMMUNE PROTECTION 


The main economic losses caused by TGEV result from high mortality 
rates in newborn piglets, under the age of 10 days. These piglets do not have a 
fully mature immune system, and the time is too short to elicit a protective 
immune response. Passive immunity from colostrum and postcolostral milk is 
crucial in providing protection to neonates against TGEV infection (Abou- 
Youssef and Ristic, 1975; Bohl and Saif, 1975). Key experiments (Stone et al., 
1977; Wesley et al., 1988; De Diego et al., 1992) showed that both the IgG and 
sIgA immunoglobulin fractions of colostrum and postcolostral milk from im- 
mune sows confer protection when fed to susceptible piglets. Thus, newborn 
animals can be protected by natural lactogenic immunity provided by sows or 
by artificial lactogenic immunity using serum or protective MAbs. 

To induce mammary antibodies, the sows have to be immunized 2 weeks 
before delivery, preferentially in the intestinal tract (Saif and Wesley, 1992). 
Stimulation of the mucosal humoral immune system against TGEV also can be 
induced by priming of the bronchus-associated lymphoid tissue (Cox et al., 
1993; De Diego et al. 1992; Wesley and Woods, 1993). Immunization in the 
mammary glands gave variable results. Some authors claimed protection (Bohl 
et al., 1975; Saif and Bohl, 1983; Woods, 1984], while others (Aynaud et al., 1986) 
did not find the intramammary route a valid alternative to the oral route. The 
available data agree on the low efficacy of the intramuscular route (Saif and 
Wesley, 1992; Moxley and Olson, 1989; Moxley et al., 1989). 

Three types of viruses have been used to develop lactogenic immunity: 
(1) virulent TGEV; (2) attenuated TGEV; and (3) TGEV-related coronaviruses 
(PRCV, FIPV, and CCV). The oral administration of nonattenuated TGEV in 
sows has generally resulted in protective levels of immunity for the sow and 
passive (lactogenic] immunity for suckling pigs (Haelterman, 1965; Bohl et al., 
1972; Saif and Wesley, 1992; Moxley and Olson, 1989; De Diego et al., 1992). 
Because vaccination with nonattenuated TGEV has the obvious risk of spread- 
ing pathogenic virus (Bohl, 1982), vaccine research has been focused on attenu- 
ated and variant strains of TGEV (Chen and Kahn, 1985; Aynaud et al., 1985, 
1988; Fitzgerald et al., 1986; Nguyen et al., 1987; De Diego et al., 1992; Cox et 
al, 1993, Wesley and Woods, 1993) or on the use of related coronaviruses 
(Woods, 1984; Hooyberghs et al., 1988). Attenuated TGEV vaccines have only 
limited efficacy. The problems were inconsistent results in their experimental 
evaluation, occurrence of epizooties, and persistence of TGEV in vaccinated 
herds (Saif and Wesley, 1992; Moxley and Olson, 1989}. 
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Comparative studies of virulent and attenuated strains of TGEV revealed 
that the former are more stable to pancreatic enzymes (Chen, 1985; Furuuchi et 
al., 1975, 1976}, acidity (Chen, 1985; Hess and Bachmann, 1976; Laude, 1981], 
and porcine intestinal fluids (Chen, 1985). These properties may protect viru- 
lent TGEV from inactivation during its passage through the upper part of the 
gastrointestinal tract and permit viral replication in the small intestine, where 
it stimulates enteric immunity. A variant of TGEV (derived from the attenuated 
Purdue strain}, resistant to trypsin and a-chymotrypsin, induced lactogenic 
immunity (Chen and Kahn, 1985). Using a survivor selection process in gastric 
juice, a mutant of TGEV, selected from a low passage strain (D-52}, acquired 
simultaneous resistance to acidity and pepsin or trypsin cleavage (Aynaud et al., 
1985; Nguyen et al., 1987). This mutant induced protective immunity when 
administered by the oral route but not by the intramuscular or intramammary 
routes (Aynaud et al., 1988). Molecular characterization of attenuated vaccine 
strains of TGEV has shown alterations in mRNAs 2 and 3 of these viruses, 
affecting the spike and the nonstructural protein 3 (Register and Wesley, 1994). 

Protection by the TGEV-related coronavirus PRCV has been studied by 
several groups. Some of them have found no protection (Hooyberghs et al., 1988; 
Van Nieuwstadt et al., 1989; Paton and Brown, 1990], while others have de- 
scribed a significant level of protection (Bernard et al., 1989; Cox et al., 1993; De 
Diego et al., 1992; Wesley and Woods, 1993). The poor protection observed by 
the first group might be due to the lack of or insufficient antigenic stimulation 
of the gut-associated lymphoid tissue during PRCV infection of the sows. PRCV 
is known to have a respiratory tropism. An intermediate degree of protection 
induced by PRCV has been described. Infection of pigs with this virus primes 
the systemic and mucosal humoral immune system against TGEV, so that 
subsequent challenge with TGEV results in a secondary antibody response and 
in a decreased duration of infectious TGEV excretion. Experimental vaccina- 
tion of seronegative naive gilts with PRCV induced lactogenic immunity 
against TGEV. The overall survival rate ranged from 47 to 70%, but was variable 
from sow to sow (Bernard et al., 1989; De Diego et al., 1992; Wesley and Woods, 
1993). 

These results suggest that there is a link between respiratory infection with 
PRCV and secreted protective antibody in the mammary glands of postpar- 
turient gilts. The level of virus-neutralizing antibody in serum and colostrum 
that was induced by PRCV vaccination did not correlate with piglet survival 
(Wesley and Woods, 1993; Bernard et al., 1989). This has also been recognized as 
a consistent feature of TGEV vaccination and challenge experiments (Saif and 
Wesley, 1992), which suggests that the immunodominant neutralizing epitopes 
for PRCV and TGEV are probably not the major contributors to passive protec- 
tion. Pigs previously infected with PRCV develop a rapid secondary immune 
response upon infection with field strains of TGEV. The duration of the TGE 
outbreak is shortened and the loss of piglets is substantially reduced. This 
situation may open new ways for vaccination against TGEV. These results are 
consistent with the apparent correlation between dissemination of PRCV and 
the reduction in incidence of TGEV observed by Jestin et al. (1987a,b). 

Vaccination against TGE with heterologous coronaviruses such FIPV 
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(Woods and Pedersen, 1979; Woods, 1984) and CCV (Woods and Wesley, 1986) 
only provided partial protection against TGEV. The reverse experiments, that 
is, Vaccination to protect against FIPV with heterologous live virus vaccines 
(TGEV, CCV, and HCV 229E|, also did not provide satisfactory results (Barlough 
et al., 1984, 1985; Toma et al., 1979; Woods and Pedersen, 1979; Scott, 1987). 


VII. NEW TRENDS IN VACCINE DEVELOPMENT 


An effective vaccine against TGE should protect newborn piglets through 
lactogenic immunity. At least two approaches could be undertaken: (1) the 
production of noninfectious antigens that are targeted to the gut, or (2) the use 
of live vectors with enteric tropism. Both approaches require the definition of 
the B- and T-cell epitopes involved in protection and the search for molecules 
promoting IgA responses. In addition, the first approach requires the incorpora- 
tion of molecules with affinity for enteric gut cells. B-cell epitopes involved in 
protection are, most frequently, those involved in the induction of neutralizing 
antibodies, although viral proteins that do not have this activity have been 
shown to induce protection against herpes simplex virus (Chan et al., 1985], 
cytomegalovirus (Reddehase et al., 1987), and other viral systems (Whitton et 
al., 1989; Klavinskis et al., 1989). At least five antigenic sites are involved in the 
induction of TGEV-neutralizing antibodies: sites A, B, and D, and the sites on 
the S protein defined by MAbs SCT and 5D5 from R. D. Woods’ and R. Wesley's 
laboratory (personal communication). Of these five domains, site A is the major 
inducer of neutralizing antibodies. Sites A and B are complex, conformational, 
and glycosylation-dependent. Site D can be represented by synthetic peptides, 
although glycosylation has a minor effect on its conformation. Site C is contin- 
uous and glycosylation-independent. A peptide from site D, which includes 
residues 379 to 386 (SFFSYGEI) from the S protein of TGEV, induced neutraliz- 
ing antibodies. This peptide induced antibodies with a higher titer in neutraliz- 
ation when coupled to a second S protein derived peptide, which includes 
residues 1160 to 1180 (Posthumus et al., 1990a,b). The combination of these 
peptides could be the first candidate for a subunit vaccine. Unfortunately, the 
neutralizing epitopes selected in these studies were defined using swine testicle 
cells and intestinal porcine epithelial cells in culture (C. M. Sánchez and L. 
Enjuanes, unpublished results), and may differ from the epitopes that induce 
protection in vivo. This is a real possibility, since no correlation has been found 
between neutralization titer and protection by some authors (Bernard et al., 
1989; Saif and Wesley, 1992; Wesley and Woods, 1993]. 

The T-cell epitopes potentially involved in protection against TGEV are 
being defined by using virus-specific T-cell hybridomas (Bullido et a1.1989) and 
polyclonal T cells. T-cell epitopes have been identified on the three major 
structural proteins of TGEV: S, M, and N. A dominant T-helper epitope defined 
in the N protein helps the synthesis of TGEV-neutralizing antibodies specific 
for the S protein in vitro (Antón et al., 1995). 

IgA isotype immunoglobulin is more stable in the gut than those of the IgG 
isotype (Porter and Allen, 1972). Several types of T-cell factors have been impli- 
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cated in the promotion of IgA responses: interleukin 5 (Harriman et al., 1988; 
Lebman and Coffman, 1988; Strober and Harriman, 1989|, transforming growth 
factor B (Coffman et al 1989; Sonoda et al 1989), and a factor that binds to the 
Fc portion of the IgA immunoglobulin on the surface of B cells (Mestecky and 
McGhee, 1987). Genes coding for these factors should be cloned and expressed 
in combination with viral antigens in order to study their role in the induction 
of secretory immune responses against TGEV. 

Two types of molecules have shown to be effective as carriers of unrelated 
antigens in the stimulation of gut-associated lymphoid tissue after oral admin- 
istration: cholera toxin B subunit and the pili of the enterotoxic K88 strain of E 
coli. Most nonviable antigens are often inefficient in promoting antibody re- 
sponses. They require large (milligram) quantities of immunogen and yield, if 
any, modest antibody responses (Czerkinsky et al., 1987, 1989; Mestecky, 1987). 
A notable exception is cholera toxin (CT), which is a potent immunogen 
(Pierce, 1978). CT and E. coli labile toxin (LT) have been shown to exert potent 
adjuvant effects in mice on gut-immune responses to unrelated antigens pre- 
sented orally (Elson and Ealding, 1984; Lycke and Holmgren, 1986; Nedrud et 
al., 1987). There has been a debate on the requirements of the holotoxin (A and B 
subunits of CT or LT). The best adjuvant effect is induced by the complete 
toxin. This adjuvant effect appears to be closely linked to the ADP-ribosylating 
action of CT and LT, associated with enhanced cyclic AMP formation in the 
affected cells, and thus it may prove difficult to eliminate the enterotoxic 
activity without loss of adjuvant activity. However, as an antigen carrier system 
providing specific binding to epithelium, including the M cells of the intestinal 
Peyer's patches, both CT and its nontoxic binding subunit moiety have been 
shown to markedly enhance the mucosal immune response to various foreign 
antigens or epitopes covalently linked to these molecules (Holmgren et al., 
1993). An antigenic determinant representing multimers of site D from the 
peplomer protein of TGEV has been expressed as a fusion protein on the 
carboxy-terminus of the LT-B subunit. The recombinant product induced 
TGEV-neutralizing antibodies (Smerdou et al., 1994). The LT-B-site D antigen 
was expressed using attenuated forms of Salmonella typhimurium, which have 
a tropism for the Peyer's patches (Curtiss et al., 1986). Double mutants of this 
recombinant Salmonella, defective in the synthesis of cAMP and the cAMP 
receptor, persisted in the gut lymphoid organs for about 3 weeks and induced 
TGEV antibodies (Smerdou et al. 1994). Its role in protection is being evaluated. 
Other attempts to develop genetically engineered vaccines using prokaryotic 
vectors have failed. Most of the S protein gene has been expressed at high levels 
in E. coli. Subcutaneous immunizations with the recombinant antigen did not 
induce neutralizing antibodies (Hu et al., 1984, 1987) nor induce protection in 
vivo. 

A 23-kDa subunit immunogen obtained from purified TGEV by sonifica- 
tion, isopycnic centrifugation, and gel filtration through Sephadex G200 was 
administered intramuscularly to gilts prior to farrowing. It protected piglets 
suckling on the vaccinated gilts (Gough et al., 1983a,b). These results were 
unexpected, as intramuscular inoculation of inactivated TGEV did not provide 
protection (Saif and Wesley, 1992) and have not been confirmed. 
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Expression in eukaryotic vectors is required for those TGEV antigenic 
determinants that are dependent on glycosylation, such as sites A and B, of the S 
glycoprotein. Two types of live eukaryotic vectors have been considered to 
induce protection against TGEV: poxvirus and adenovirus. Porcine poxviruses 
and adenoviruses induced systemic and secretory IgA responses in pigs (Tuboly 
et al., 1993). Attempts to immunize against TGEV with TGEV-vaccinia recom- 
binants expressing most of the peplomer protein have resulted in the induction 
of TGEV-neutralizing antibodies (Hu et al., 1987) but not in protection. Adeno- 
viruses have the advantage of their tropism for gut- and bronchus-associated 
lymphoid tissue. Also, these viruses have a DNA genome that facilitates their 
use as vector (Graham and Prevec, 1992; Prevec et al., 1989; Schneider et al., 
1989; Lubeck et al., 1989; Bett et al., 1993). Using human adenoviruses-based 
vectors, the whole spike protein and truncated fragments of this glycoprotein 
have been expressed. The Ad5- TGEV recombinant induced TGEV-neutralizing 
antibodies when administered to hamsters [Sánchez et al 1994). Ad5- TGEV 
recombinants infected swine. The tropism of these recombinants is being deter- 
mined using a recombinant Ad5-luc vector (Mittal et al., 1993), expressing 
luciferase activity. Both respiratory and enteric tissues were infected and the 
enzymatic activity expressed (J. M. Torres, C. M. Sánchez, F. Graham, and L. 
Enjuanes, unpublished results). Porcine adenoviruses are being developed as 
vectors to induce protection against TGEV. The genome of porcine adenovirus 
type 3 (Seshidar-Reddy et al., 1993) and type 4 (Kleibocker et al., 1993) have been 
cloned and partially sequenced and deletions on the E3 gene introduced to 
facilitate the cloning of heterologous genes, including the S gene of TGEV. 

Expression of TGEV proteins under the control of the baculovirus poly- 
hedrin promoter or in filamentous fungi (as Aspergillus nidulans or the Mucor 
system) may be an economic way of producing large amounts of glycosylated 
antigens (Van Brunt, 1986). The S, M, and N proteins of TGEV have been 
expressed using baculovirus (Britton et al., 1990a; Pulford et al., 1990; Tuboly et 
al., 1993]. The recombinant baculovirus developed by Tuboly et al. (1993) con- 
tained inserts ranging from 1.6 kb, encoding sites C, B, and D, to 3.3 kb, 
encoding all the major antigenic sites (C, B, D, and A). Piglets immunized with 
the recombinants developed a strong antibody response, but only recombinants 
expressing at least part of antigenic site A induced in vitro neutralizing anti- 
bodies (T. Tuboly and J. B. Derbyshire, personal communication]. 

There is an increasing interest in the use of RNA viruses that do not 
replicate through a DNA intermediate as vectors for the expression of hetero- 
logous genes. This is the case of alphaviruses (Schlesinger, 1993; Bredenbeeck et 
al., 1993; Liljestróm and Garoff, 1991). Coronaviruses have a genome of 30 kb, 
which is too large to be easily manipulated. This has prompted the isolation and 
characterization of defective subgenomic RNAs, which are easily generated in 
the murine coronavirus MHV. Identification of the minimum requirements to 
replicate these subgenomic RNAs (Makino et al., 1988; Makino and Joo, 1993), 
the packaging signal (Makino et al., 1990; van der Most and Bredenbeek, 1991; 
Fosmire et al., 1992), and the possibility of engineering cDNAs coding for these 
RNAs (Koetzner et al., 1992; van der Most et al., 1992; Masters, 1992; Masters 
et al., 1994) are helping the development of vectors based in coronaviruses. 
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TGEV subgenomic RNAs have been identified. A cDNA has been derived from 
a 10-kb defective RNA of TGEV. This cDNA has been cloned and sequenced 
(Mendez et al., 1994). The subgenomic RNA is packaged into capsids and it can 
be used to develop a TGEV-based vector to induce secretory immunity in swine 
(C. Smerdou, A. Mendez, M. L. Ballesteros, C. M. Sánchez, and L. Enjuanes, 
unpublished results). 

Anti-idiotypic antibodies are in theory an interesting source of antigen, 
especially those mimicking the complex antigenic site A, the major inducer of 
neutralizing antibodies in TGEV. The use of antibodies as antigen may have the 
advantage of their adsorption into the gut, particularly in the first days after 
birth. Induction of neutralizing antibodies to TGEV by porcine anti-idiotypic 
antibodies generated against a murine-neutralizing MAb has been shown in 
mice (Hariharan et al., 1989). These antibodies neutralized TGEV in vitro, but 
no protection studies in vivo have been reported. Monoclonal anti-idiotypic 
antibodies of the y and B (internal-image] type, which induced and Ab3 TGEV- 
neutralizing, are now being tested in vivo for protection (Suñé et al., 1991). 
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CHAPTER 17 


Pathogenesis of the Porcine 
Coronaviruses 


DAVID J. GARWES 


I. GENERAL INTRODUCTION 


Of all the animal species that the coronaviruses have evolved to grow in, the pig 
appears to have become host to more members of the family than any other. 
The filterable agent identified as the cause of transmissible gastroenteritis 
(TGE) in American swine in the 1960s was subsequently confirmed as a coro- 
navirus, TGEV, shortly after the family was formally identified in 1969. During 
the following decade, the causal agent of an encephalomyelitic disease that was 
associated with vomiting and wasting in swine in North America and Europe 
was shown to be a second coronavirus, antigenically unrelated to TGEV, named 
hemagglutinating encephalomyelitis virus (HEV). 

A third porcine coronavirus was identified in association with a TGE-like 
disease of pigs in England and Europe during the 1970s. This virus, named 
porcine epidemic diarrhea virus (PEDV), had a pathogenesis similar to that of 
TGEV but was quite unrelated to either TGEV or HEV. The fourth porcine 
coronavirus to be identified was isolated from mild respiratory infections of 
European pigs that had serum antibodies to TGEV but showed no evidence of 
gastroenteric infection. Recent work has confirmed that this virus is closely 
related antigenically and genetically to TGEV but has a greatly reduced ability 
to grow in the intestinal mucosa. It is clear that the virus, named porcine 
respiratory coronavirus (PRCV|, is a mutant strain of TGEV, but, as its patho- 
genesis is quite distinct from that of TGEV, it will be described separately. 
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II. TRANSMISSIBLE GASTROENTERITIS VIRUS 
A. History of the Disease 


Transmissible gastroenteritis of swine was probably first described in the 
1930s (Smith, 1956), although the demonstration of a viral etiology was not 
confirmed until the report of Doyle and Hutchings in the United States was 
published in 1946. They clearly showed that a filterable agent could be trans- 
mitted to piglets with the subsequent induction of a frequently fatal gastro- 
enteritis. The growth of the science of virology following this allowed further 
demonstrations of the viral cause of TGE, its isolation in cell culture, and the 
subsequent identification of TGEV in Canada, Europe, Taiwan, and Japan (re- 
viewed by Woode, 1969). Following the formal acceptance of the family Corona- 
viridae (Tyrrell et al., 1968), the causal agent of TGE was recognized as a corona- 
virus by Tajima (1970) 


B. Clinical Signs 


The severity of disease caused by TGEV is related to the age of the animal. 
Although pigs of all ages can become infected and sows can suffer from TGE to 
such an extent that agalactia occurs, it is the young piglet that usually shows 
the most severe clinical signs. Infection of the piglet during the first 2 weeks of 
life results in vomiting within 18 to 24 hr. This is usually followed by a profuse 
watery diarrhea that continues for several days and results in obvious dehydra- 
tion and frequently death 2 to 5 days after infection. In pigs over 2 weeks of age, 
the initial response to infection is the same as that of the neonatal piglet, but 
the mortality rate drops as the body weight of the animal at the time of 
infection increases. This probably reflects the ability of the animal to withstand 
dehydration during induction of an immune response that leads to subsequent 
recovery. 


C. Pathology 


The usual route of infection is oral, following ingestion of contaminated 
material. The virus is resistant to the low pH of the stomach and passes to the 
small intestine where it infects the columnar epithelial cells covering the distal 
portion of the villi in the jejunum and ileum. The virus rarely colonizes the 
epithelium of the duodenum, supporting a hypothesis that the virus is actively 
taken up by the microcanalicular-vesicular system found in villus epithelium 
of the jejunum and ileum in the young piglet bu? not in the duodenum after the 
second day of life (Wagner et al., 1973). The cells covering the tips of the villus 
are the first to be infected, and the virus then spreads down the villus while the 
infected cells at the tips are shed. The virus is never found in the cells at the base 
of the villi or in the crypt cells, and it may be that these cells are refractory to 
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infection. As the infected cells are shed from the villous tips, cells migrate up 
from the crypt to replace them and this terminates the infection, usually 5 to 7 
days after infection, as the new cells are not reinfected (Moon et al., 1976). 
Villous atrophy, detected as a measurable shortening of the villi, is associated 
with TGEV infection as with other forms of gastroenteritis (Hooper and Hael- 
terman, 1966). 

In a comparative study with attenuated and virulent strains of the virus, an 
apparent correlation was seen between virulence and tissue tropism. While the 
Purdue-115 and Gep II strains, which both induce clinical signs in week-old 
piglets, could be demonstrated in the enterocytes of the jejunum and ileum, 
Peyer's patches, and the mesenteric lymph nodes, the attenuated Nouzilly 
vaccine strain multiplied only in the ileum and mesenteric lymph nodes (Cu- 
bero et al., 1992). 

TGEV has not been detected in the cells of the large intestine, and the 
clinical signs appear to result from an increase in osmotic pressure in the lumen 
of the small intestine. The villous epithelium lost during infection carries the 
enymes responsible for the digestion of milk lactose, resulting in an inability to 
absorb the sugar (Hooper and Haelterman, 1966). Additional studies also sug- 
gested that the cells that migrate from the crypt to repopulate the denuded villi 
retain a secretory role, producing an abnormal level of sodium secretion into 
the lumen (Butler et al., 1974)) and loss of blood albumin (Prochazka et al., 
1975]. 

TGEV can be isolated from other organs of the infected pig (Cartwright, 
1967) including the kidneys and the lung. Several studies have shown that virus 
replication in the lung may be important for transmission and persistence of the 
disease (Underdahl et al., 1974; Kemeny et al., 1975). Lung infection, with 
TGEV antigen being detected in bronchiolar epithelial cells and alveolar cells, 
does not appear to be associated with respiratory disease, but may explain the 
emergence of the porcine respiratory coronavirus, discussed in Section III. 

A study of the thymus in TGEV-infected conventional and gnotobiotic 
piglets suggested that, over a 4-day period, infection accelerated thymus involu- 
tion, reduced thymocyte density, and resulted in hypertrophy of Hassall's 
bodies in the medulla (Kvachev and But, 1990]. 


D. Immune Response 


Since the target cell for TGEV is the apical epithelium of the intestinal 
villi, host immunity requires the secretion of antibodies into the lumen of the 
gut. For this reason, parenteral immunization with TGEV antigens, while 
providing good levels of circulating serum antibodies to the virus, provides no 
protection against the disease. During TGE infection the immune response 
results in the secretion of protective antibodies into the lumen of the small 
intestine and this plays a part in limiting the infection in the older pig (Bohl 
et al., 1972a). The rapid progression of the disease to death in newborn piglets 
infected with TGEV probably provides insufficient time for immunity to be 
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induced. It is well documented, however, that sows that have recovered from 
TGE can protect their offspring by passive transfer of immunity (Bay et al., 
1953). Following infection of the pregnant sow, B lymphocytes secreting IgA 
antibodies to TGEV circulate from the gut-associated lymphoid tissue and 
relocate in the mammary gland. For the first 3 days after piglets are born, the 
colostrum secreted by the mammary gland contains antibodies derived directly 
from the serum. Following this, however, the only immunoglobulin present to 
any extent in porcine milk is secretory IgA, which is synthesized in the mam- 
mary gland. In the convalescent sow the IgA secreting cells relocated from the 
gut-associated lymphoid tissue can provide protective antibodies to the piglets 
for several weeks (Bohl et al., 1972b). 

More recently, however, research findings have suggested that there may be 
other factors involved in the ability of milk from immune sows to protect 
piglets. The levels and classes of anti-TGEV antibodies were measured in milk 
from sows naturally infected with TGEV or immunized with the Nouzilly 
vaccine strain (Bernard et al., 1990). No correlation was demonstrated between 
the level of protection afforded and the titers of the various classes of antibodies 
to the virus in the colostrum and milk for the first 10 days after farrowing, 
although an inverse correlation was seen after 10 days. 

The role of cellular immunity in recovery from TGE has not been clearly 
demonstrated, although studies have shown lymphocyte responses in the 
Peyer’s patches and mesenteric lymph nodes of young piglets infected with 
virulent TGEV but not with attenuated virus (Welch et al., 1988). These authors 
suggested that this may in part explain why attenuated virus vaccines have 
proved to be less than fully successful for TGE. When used in the pregnant sow, 
vaccines have had limited success in generating lactogenic immunity, since 
most have failed to induce a secretory IgA response and relocation of active B 
lymphocytes in the mammary gland (Bohl et.al., 1975; Saif and Bohl, 1979). 


III. PORCINE RESPIRATORY CORONAVIRUS 
A. History of the Disease 


During routine serological surveillance of pig herds in Great Britain, Bel- 
gium, Holland, and France in the mid-1980s, an increase in the number of herds 
with antibodies to TGEV was noted but with no concomitant increase in 
clinical enteric disease. A coronavirus, PRCV, was isolated in 1986 from respira- 
tory tissue of affected pigs in Belgium (Pensaert et al., 1986), Great Britain 
(Brown and Cartwright, 1986), and subsequently from other parts of Europe and 
North America (Wesley et al., 1990). The virus was shown to be indistinguish- 
able from TGEV by conventional serological tests, although monoclonal anti- 
bodies to TGEV could be used as the basis for a differential diagnostic immuno- 
assay (Garwes et al., 1988). 

By the end of the 1980s, the virus had spread to infect almost 100% of 
Belgium pig farms (Pensaert and Cox, 1989). It is likely that the virus is spread 
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by the oronasal route and by aerosol. This is supported by data from a study 
with experimentally infected pigs in which airborne virus could be recovered 1 
to 6 days after infection and at levels that correlated with those in the nasal 
mucosa (Bourgueil et al., 1992). 

Several possible sources for the origin of this respiratory coronavirus have 
been discussed (Jestin et al., 1987). These include recombination between 
TGEV and another coronavirus, mutation of TGEV into a nonenteric strain, and 
adaptation of a TGEV-related coronavirus from another host species or its 
release as an attenuated TGEV vaccine. Comparative sequence analysis of the 
genomes from PRCV and TGEV have shown that there are high levels of 
homology (Britton et al., 1991; Wesley et al., 1991). The major difference be- 
tween the two viruses was determined to be a 600-700 base deletion in the gene 
encoding the surface peplomer, suggesting that the difference in tissue tropism 
for the two viruses might reside in the absence of a specific cell receptor in 
PRCV (see Chapter 16, this volume). 


B. Clinical Signs 


Most observers have noted little or no sign of respiratory disease associated 
with infection of pigs of all ages with this virus. Experimental studies to 
examine the disease have given somewhat contradictory results, although dif- 
ferences in the PRCV isolate and the age of the pigs used may explain the 
findings. 

O'Toole and co-workers (1989) infected colostrum-deprived piglets aged 6 
to 7 days with the UK isolate of PRCV. The animals remained clinically healthy 
and developed specific antibodies 1 week later. Van Nieuwstadt and Pol (1989), 
however, used a Dutch isolate of the virus to infect specific pathogen-free pigs 
aged 5 weeks and recorded a severe illness in all infected animals after 1 to 2 
days. The disease was characterized by anorexia, lethargy, labored respiration, 
and fever. Coughing was observed after 8 days and the condition improved from 
10 days on, although two of the animals died of pneumonia 4 and 7 days after 
inoculation. Vannier (1990) noted a transient hyperthermia, mild to severe 
dyspnea, polypnea, and an obvious cessation of growth in colostrum-derived 
piglets infected by the intratracheal route. 

A clinical disease, with fever, growth retardation, and lung lesions, was 
induced by PRCV in 6- to 8-week-old specific pathogen-free pigs and this was 
not exacerbated by concurrent infection with swine influenza viruses (Lanza 
et al., 1992). 


C. Pathology 


In the UK study, O'Toole and his co-workers (1989) examined the pathology 
produced by PRCV and compared it with that produced by TGEV. They found 
that both viruses caused a mild cranioventral bronchointerstitial pneumonia 
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involving between 5 and 20?6 of the lung parenchyma. Both viruses were shown 
to replicate in the nonciliated cuboidal bronchiolar cells, and these cells then 
bulged into the lumen and subsequently detached. There was an accumulation 
of fibrin, macrophages, and cellular debris in the bronchioles, alveolar ducts, 
and alveoli. A minority of the infected piglets showed other pathology, includ- 
ing mild laryngitis, tracheitis, and rhinitis. While TGEV was identified in 
association with intestinal pathology, PRCV produced no intestinal lesions, 
although low levels of virus were isolated from the small intestine, suggesting 
limited replication. 

The pigs that died in the study by Van Nieuwstadt and Pol (1989) were 
shown to have a catarrhal lobular disseminated bronchopneumonia involving 
all lobes of the lung. An American strain of PRCV, isolated from the intestine of 
a pig from a herd with endemic TGE, was capable of causing 60% consolidation 
of the lung 10 days after oronasal infection (Halbur et al., 1993). There was a 
necrotizing and proliferative bronchointerstitial pneumonia characterized by 
necrosis, squamous metaplasia, dysplasia, proliferation of airway epithelium, 
mononuclear cell infiltration of alveolar septa, mild type II pneumocyte prolif- 
eration, and lymphohistiocytic alveolar exudation. The lesions were not associ- 
ated with clinical disease and were resolved by 15 days after infection. 

In addition to PRCV replication in the respiratory tract, Pensaert's group 
demonstrated that viremia occurred and some virus reached the gastrointesti- 
nal tract (Cox et al., 1990a). There was limited replication of the virus in the 
jejunum, spreading to the ileum and the duodenum, in a few cells located in the 
subepithelial layers of the villi and crypts. Following direct inoculation into the 
lumen of the gut of 1-week-old piglets, PRCV replication was similarly limited 
to these sites and virus could be isolated from feces for several days (Cox et al., 
1990b}. 


D. Immune Response 


Infection of pigs with PRCV results in the production of serum antibodies 
that react with PRCV and with TGEV in virus neutralization, immune fluores- 
cence, and enzyme-linked immunosorbent assay (ELISA) tests. It is likely that 
these antibodies confer some degree of protection on the pigs, although the 
mildness of the disease and the widespread occurrence of the virus make this 
difficult to assess. The antigenic relatedness of PRCV and TGEV suggested the 
possibility that seroconversion to PRCV would protect pigs against subsequent 
infection with TGEV. Protection against subsequent challenge with TGEV was 
seen in 10-week-old pigs that had been infected with PRCV by aerosol (Cox et 
al., 1993). However, pig herds that are known to be seropositive to PRCV have 
become infected with TGEV, and no protection has been afforded to nursing 
piglets in an experimental study (Paton and Brown, 1990). Whether this absence 
of cross-protection is due to critical, yet minor, antigenic differences between 
the two viruses or to the absence of lactogenic immunity following PRCV 
infection of sows has yet to be determined, although a degree of protective 
lactogenic immunity was claimed by Bernard et al. (1989). 
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IV. PORCINE EPIDEMIC DIARRHEA VIRUS 
A. History of the Disease 


A TGE-like disease was observed in young and old pigs in England at the 
end of 1976 and was termed "epidemic diarrhea type II" (Wood, 1977). The 
affected animals did not seroconvert to TGEV, however, and TGEV antigen 
could not be detected in the infected intestinal epithelium. Examination of gut 
contents from affected pigs in England (Chasey and Cartwright, 1978) and 
Belgium (Pensaert and Debouck, 1978) showed particles with characteristic 
coronavirus morphology, and the agent was recognized as a distinct coronavirus 
and named porcine epidemic diarrhea virus (PEDV). The disease was subse- 
quently identified in other parts of Europe, including Czechoslovakia, Hungary, 
and Germany (Pensaert, 1981). Attempts to characterize the virus were hin- 
dered by its inability to replicate in cell culture until Hofmann and Wyler (1988) 
showed that the use of trypsin overcomes this block. The agent was subse- 
quently shown to have physicochemical characteristics that confirmed its 
classification as a coronavirus (Hofmann and Wyler, 1989; Kusanagi et al 1992) 


B. Clinical Signs 


The disease resembles TGE very closely and it is difficult to distinguish 
between them in the field. Mortality is generally restricted to piglets aged 1 
week or less, but clinical signs and morbidity are frequently seen in the older 
animal, with diarrhea, depression, and anorexia. Experimental infection of pigs 
with an isolate of the PEDV from Germany, V215/78 (Witte et al 1981), showed 
that a proportion of animals aged 12 weeks or older displayed severe symptoms 
of depression and lassitude. 


C. Pathology 


The replication of PEDV closely resembles that of TGEV, with viral anti- 
gens being detected in the epithelial cells of the villi in the small intestine 
shortly after infection, followed by desquamation and villous stunting. Unlike 
TGE, however, PEDV antigens have been detected in the epithelium of the 
colon. There have been no reports to date of replication in the lung. 

Macroscopic lesions appear to be limited to the small intestine. The damage 
to the villi results in impaired intestinal function, as with TGE, but the extent 
of intestinal degeneration is reported to be less than that seen with TGEV. 


D. Immune Response 


Serum of pigs recovering from infection with PEDV contain antibodies to 
the virus detectable by immune fluorescence and virus neutralization tests. 
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These antibodies do not cross-react with TGEV, and further evidence for the 
lack of serological relationship between the two viruses was provided by the 
absence of cross-protection in pigs that had recovered from infection with one 
virus and were subsequently infected with the other (Debouck and Pensaert, 
1980). It is very likely that protection of suckling piglets against PEDV would be 
based on lactogenic immunity and, as discussed for TGEV, suitable vaccines 
have yet to be developed. 


V. HEMAGGLUTINATING ENCEPHALOMYELITIS VIRUS 
A. History of the Disease 


A disease of neonatal pigs was described in Canada in 1958 in which the 
clinical signs were vomiting and wasting (Roe and Alexander, 1958). A similar 
disease, reported in Ontario the following year, developed into an acute enceph- 
alomyelitis (Alexander et al., 1959), and subsequent studies showed that the 
disease could be transmitted. The causal agent was not identified until 1962, 
however, when a virus was isolated in cell culture from piglets with encephalo- 
myelitis (Greig et al., 1962). The virus produced syncytia in monolayer cultures 
of infected pig kidney cells, and the supernatant fluid from these cultures could 
hemagglutinate chick erythrocytes, prompting the name hemagglutinating en- 
cephalomyelitis virus (HEV). 

Several years later in England, a virus was isolated in cell culture from the 
brain of a piglet exhibiting the vomiting and wasting disease described earlier in 
Canada (Cartwright et al., 1969). This virus also had the ability to produce 
syncytia and hemagglutinate red cells, and a serological relationship with the 
Canadian HEV isolate was shown, indicating that the two viruses were closely 
related if not identical. The virus was recognized from its morphology to be a 
member of the coronavirus group in 1971, and was subsequently identified in 
association with diseased pigs from many parts of the world (reviewed by Greig, 
1981]. 


B. Clinical Signs 


The two diseases induced by HEV, vomiting and wasting or encepha- 
lomyelitis, both become established in piglets in the first few days of their lives 
but are quite distinct clinically. The vomiting and wasting form starts with 
anorexia, depression, and inappetence. This is followed by retching and, in 
some piglets, vomiting of undigested milk (Lai et al., 1992). The ability to drink 
appears to be impaired and the piglets rapidly lose weight. There is no evidence 
of diarrhea, although constipation has been recorded. Three to four days after 
the first clinical signs are seen, infected piglets may have distended abdomens. 
Affected animals may die within 1 to 2 weeks but frequently survive for several 
weeks before dying of starvation or secondary infection. 
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The encephalomyelitic form of the disease often starts with clinical signs 
similar to those described for the vomiting and wasting form, but the coat 
becomes staring, the extremities cyanotic, and signs of CNS involvement are 
soon manifest. These include some respiratory distress, hyperesthesia, and 
slight paralysis of the hind legs. In advanced stages of the disease the piglets 
have trouble breathing, become blind, and progress into terminal coma. The 
clinical signs develop over a period of about 10 days, but occasionally pigs 
survive and recover to show no aftereffects. 


C. Pathology 


Infection occurs via the oronasal route, and the primary site of virus repli- 
cation appears to be the respiratory tract and the pharyngeal tonsils. There is 
little or no replication of the virus in the alimentary tract, and the vomiting 
produced is not related to a gastrointestinal infection as seen with TGEV and 
PEDV. The elegant studies of Andries and Pensaert (1980a,b) clearly demon- 
strated the spread of the virus by way of the peripheral nervous system to the 
CNS. Viral antigens were identified in the early stages of infection in the 
trigeminal ganglion, the inferior vagal ganglion, the superior cervical ganglion, 
the intestinal nervous plexuses, the solar ganglion, and the dorsal root ganglia 
of the lower thoracic region. Infection of the brain stem was shown to occur first 
in the medulla oblongata and then spread to other parts of the brain, including 
the cerebrum and cerebellum in later stages of infection. Virus was detected in 
the nervous plexus of the stomach after the incubation period, but it was not 
clear whether vomiting resulted from damage to these nerve plexuses or was 
controlled by CNS. 


D. Immune Response 


Circulating antibodies to HEV are found in serum from infected animals 
and these have been used to diagnose the disease by hemagglutination inhibi- 
tion and virus neutralization tests. The epidemiology of HEV-induced disease 
suggests that herd immunity plays a part in limiting the spread of the virus 
(Appel et al., 1965). These authors conducted experimental infections that 
indicated that piglets are protected by maternal antibodies present in colos- 
trum. 

In a serological survey of pig herds in Belgium, Pensaert and co-workers 
(1980) found that piglets suckled by immune dams acquired neutralizing anti- 
bodies to HEV that persisted for 11 to 12 weeks. This passive immunity was 
replaced by active immunity between 8 to 16 weeks as the piglets became 
infected by virus circulating within the herd, but there were no clinical signs 
associated with this seroconversion, probably because the piglets were pro- 
tected by the maternal antibodies. 
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VI. CONCLUSIONS 


It is not obvious why the pig should provide such a suitable host for 
coronaviruses. Intensive rearing focuses attention on the health of a species and 
provides an environment that allows rapid spread of a newly emerged virus and 
this may provide a partial explanation. The factors that determine host species 
range are not clear for most virus groups, although the presence of closely 
related coronaviruses in pigs (TGEV], dogs (canine enteric coronavirus), and 
cats (feline infectious peritonitis) suggests that coronaviruses may move readily 
between species. HEV is antigenically related to murine, bovine, and human 
coronaviruses and may have evolved from one of these. The evolution of PRCV 
is readily explained by mutation of TGEV which is well established in pig herds 
in all parts of the world, but the emergence of PEDV as a pig pathogen is more 
difficult to explain. It is most likely that it arose from a coronavirus infecting a 
species of wild life in close contact with pigs. If this is so, then the conclusion 
may be drawn that other coronaviruses remain to be isolated from species that 
have not yet been studied in depth, thereby providing a reservoir from which 
new diseases of commercially important animals may arise. 
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CHAPTER 18 


Human Coronavirus Infections 


STEVEN H. MyYINT 


I. INTRODUCTION AND HISTORY 


The first report of a human coronavirus was in 1965 when Tyrrell and Bynoe 
(1965) isolated a virus from the nasal washings of a male child. The child had 
typical symptoms and signs of a common cold and the washing was found to be 
able to induce common colds in volunteers challenged intranasally. The virus, 
termed B814 (after the number of the nasal washing), could be cultivated in 
human embryo tracheal organ tissue but not in cell lines used at that time for 
growing other known etiologic agents of the common cold. At the same time, 
Hamre and Procknow (1966) were characterizing five ‘‘new” agents isolated 
from the respiratory tract of medical students with colds. One of these agents, 
strain 229E, was adapted to grow in WI-38 cells. Subsequently, Almeida and 
Tyrrell (1967) showed that these isolates were morphologically identical to the 
viruses of avian bronchitis and mouse hepatitis. McIntosh and colleagues 
(1967a), working at the National Institutes of Health in Bethesda, then isolated 
six morphologically related viruses that could not be adapted to cell monolayer 
culture but would grow in organ cultures. Two of these isolates, OC (for organ 
culture) 38 and 43 were then adapted to grow in suckling mice brain. The term 
“coronavirus,” which described the characteristic morphology of these agents, 
was accepted in 1968 (Tyrrell et al., 1968a). 


II. HUMAN RESPIRATORY CORONAVIRUSES 


There are a number of human respiratory coronaviruses described in the 
literature, but few have been well characterized (Table IJ. On the basis of 
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TABLE I. Classification of 
Human Coronaviruses 


Serogroup Prototype ` Virus 


A 229E 229E 


B OC43 OC A3 


Unclassified B814 


serological cross-reactivity, however, it is possible to classify most of them 
(Reed, 1984; Bradburne, 1970; McIntosh et al., 1969). The two main serogroups 
are 229E-related and OC43-related and it is these prototype viruses that will be 
discussed in the rest of this chapter. Coronaviruslike particles have been seen in 
the stools of humans, but, as they have not been characterized, they will not be 
discussed in this chapter. 


III. EPIDEMIOLOGY 


Most epidemiological surveys of these viruses as agents of respiratory tract 
illness have been based on serology, using either complement fixation or 
hemagglutination-inhibition tests for the two prototype viruses 229E and 
OC43. As it is clear that serologically unrelated coronaviruses exist, the preva- 
lence of these viruses is likely to be underestimated. Moreover, it is also certain 
that not all human respiratory coronaviruses have yet been adapted to tissue or 
organ culture. This conclusion is supported by a study based in London, Eng- 
land, in which coronaviruses were isolated, by tissue or organ culture, from 
18.4% of patients with common colds. However, a further 13% of the isolates 
were able to induce colds in inoculated volunteers, although they could not be 
identified as any of the known "common cold” viruses (Larson et al., 1980). 

The most extensive epidemiological survey of OC43 infection has been the 
study of the community of Tecumseh, Michigan (Monto and Lim, 1974). A 
4-year study involved looking for serological evidence of infection with OC43 
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in 910 persons in 269 families. A mean of 17.1% of individuals showed evidence 
of infection with OC43 in any one year. There was, however, a cycling of the 
frequency of occurrence: OC43 infections occurred in most years but there were 
peaks of infection every 3 years. Over 80% of infections occurred despite 
preexisting antibody with infections gradually diminishing with age. The peaks 
of infection took place in the winter-spring months. A study of 229E infections 
in the same community showed a mean annual rate of 7.7% infected persons 
(Cavallaro and Monto, 1970). This figure was half of that recorded in a 6-year 
study of Chicago medical students, in which there was also marked year-to-year 
variation with peaks of 35% incidence (Hamre and Beem, 1972). Nearly all 229E 
infections also occurred in the winter and spring months. The Tecumseh study 
showed that either 229E or OC43 was dominant in any one year. 

Studies of coronaviruses as causes of clinical illness have shown that 
coronaviruses are second only to rhinoviruses as the causes of the common 
cold. In the United States and England, 229E and OC43 are responsible for 1 to 
30% of all clinical cases (McIntosh et al., 1970a,b; Isaacs er al 1983; Wenzel 
et al., 1974; Kaye et al., 1971; Owen-Hendley et al., 1972; Bradburne et al 1967), 
with approximately an equal number of subclinical infections. 


IV. DISEASE MANIFESTATIONS 


Human respiratory coronaviruses are now well accepted as causes of upper 
respiratory tract illness and, in particular, the common cold. The viruses have 
been isolated from patients with the common cold and they produce common 
colds when inoculated into volunteers intranasally (Andrewes, 1962). There is a 
mean incubation period of 3 days (range 2-5 days) followed by an illness that 
lasts a mean of 6-7 days (range 2-18 days). The classical clinical illness is well 
known to all of us and consists of general malaise, headache, nasal discharge, 
sneezing, and a mild sore throat (Tyrrell et al., 1993). Approximately one tenth 
of patients will also have a fever and one fifth will have a cough. Table II 
contrasts the clinical features of rhinovirus type 2-induced common colds, 
those induced by coronavirus 229E, and the illness caused by the influenza A 
virus (compiled from Lowenstein and Parrino, 1987; McIntosh et al., 1973, 
1974). It is not possible on an individual basis to distinguish rhinovirus colds 
from coronavirus colds. Although earlier studies suggested that there may be 
differences in the clinical symptomatology with different coronaviruses, this 
has not been substantiated. 

Less well documented are lower respiratory tract infections associated 
with coronaviruses. A seroepidemiological study, using a complement fixation 
test, found that coronaviruses were less likely to be found in hospitalized 
children with lower respiratory tract infection than in controls with non- 
respiratory tract disease. From 565 children with lower respiratory tract infec- 
tion there was evidence of infection with OC38 or OC43 in 3.596, compared to 
8.2% of 245 children in the group with nonrespiratory tract disease (McIntosh 
et al., 1970a,b). In a later study, however, there was serological evidence of 
either 229E or OC43 infection in 8.2% of 417 hospitalized children under 18 
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TABLE II. Clinical Features of Rhinovirus, Coronavirus, 
and Influenza A Respiratory Tract Infection 


Coronavirus 229E Rhinovirus 2 Influenza A 


Clinical feature (96) (96) (96) 
Fever 9-23 7-16 98 
Nasal discharge and/or obstruction 94-100 ` 64-100 20-30 
Headache 32-85 28-50 85 
General malaise 46-47 28-43 80 
Sneezing 85 50 30 
Sore throat 54-68 87-93 50-60 
Cough 21-31 64-68 90 
Hoarseness 12 57 10 
Myalgia 9 21 60-75 
Watery/sore eyes 29 43 60-70 
Chills 18 21 90 


months of age with lower respiratory tract disease (McIntosh et al., 1974). The 
incidence of coronavirus infection was higher than that of other respiratory 
viruses, except parainfluenza virus type 3 and respiratory syncytial virus. It is 
unlikely that there is a direct infection of the lower respiratory tract with 
coronaviruses and any association is most probably through secondary phe- 
nomena. Though these secondary phenomena are yet to be defined, a link 
between coronavirus infection of the upper respiratory tract and wheezing 
attacks has been shown in several studies over the last 20 years (Isaacs et al., 
1983; McIntosh et al., 1973; S. L. Johnston et al., unpublished data). At particu- 
lar risk are asthmatic children, in which up to 30% of acute wheezing episodes 
may be due to coronavirus infection. Increased airways resistance has been 
shown to occur in the upper respiratory tract in nonatopic individuals with 
colds [Bende et al 1989; Akerlund, 1993) and asthmatic individuals are likely to 
be at greater risk. 

Nonrespiratory tract illnesses have also been associated with corona- 
viruses and include multiple sclerosis, pancreatitis, thyroiditis, pericarditis, 
nephropathy, and infectious mononucleosis (Riski and Hovi, 1980; Apostolov 
and Spasic, 1975; Arnold et al., 1981]. The association with multiple sclerosis 
has been of particular interest since coronaviruslike particles were seen in the 
postmortem brain of a patient who died with the disease (Tanaka et al., 1976). 
Subsequently, OC43-related coronaviruses were isolated from the brain mate- 
rial of two multiple sclerosis patients (Burks et al 1980), although it is likely 
that these isolates were murine coronaviruses present in the mice used for 
cultivation (Weiss, 1983; Gerdes et a1., 1981; Fleming et al., 1988). Seroepidemio- 
logical studies trying to ascertain an association of coronaviruses with multiple 
sclerosis have been conflicting (Madden et al 1981; Salmi et al., 1982; Hovanec 
and Flanagan, 1983; Leinikki et al., 1981). Gene detection has failed to detect 
OC43 (Sorensen et al., 1986) but has suggested a neurotropism for 229E (Stewart 
et al., 1992). It is clear that further research in this area is needed. 

The role of coronaviruses in causing diseases outside the respiratory tract 
has been doubted, in part, because there has been no evidence that the virus can 


HUMAN CORONAVIRUS INFECTIONS 393 


spread from the nasal mucosa. Using the polymerase chain reaction, however, it 
has recently been possible to show that there is a short viremic phase of 229E, in 
at least some experimentally inoculated volunteers (S. Myint, unpublished 
data]. The identification of the 229E receptor as a metalloprotease that is found 
on the surface of cells in many tissue types would also suggest a possible 
involvement of human coronaviruses in diseases other than just the common 


cold. 


V. PATHOGENESIS AND IMMUNE RESPONSE 


Little is known about the detailed pathogenic mechanisms in human corona- 
virus infection, principally because there is no animal model of infection. Even 
the predominant mode of transmission is uncertain. Infection can be induced 
experimentally by direct inoculation of virus into the nose, but this is unlikely 
to be the natural route. By analogy with rhinoviruses it is likely that infection is 
either by aerosols or fomites (Editorial, 1988). In support of this view is the 
finding that 229E survives well in an atmosphere of high humidity and low 
temperature (Ijaz et al., 1985). 

Once in the nose, the virus is thought to enter the cell via a specific 
receptor, aminopeptidase N. Replication is optimal at 32-33 °C, the tempera- 
ture in the superficial layers of the nasal mucosa. This results in sloughing of 
the superficial nasal epithelium and a proteinaceous exudate. Serum antibody 
levels rise after about a week, but it is not clear whether it is this response or 
cell-mediated mechanisms that clear the infection (Callow et al., 1990). Cer- 
tainly, there is some correlation of the severity and likelihood of disease with 
preexisting serum antibody, but the mere presence of such antibody is not 
protective (Callow, 1985). Serum antibody levels peak about 2 weeks after 
infection and decline to low or undetectable levels at 12-18 months. Reinfec- 
tions are common. 


VI. DIAGNOSIS 


Because of the trivial and temporary nature of common colds, the detection 
of coronavirus infections has not been attempted in routine diagnostic laborato- 
ries. This situation is unlikely to change unless antiviral therapy becomes 
available. Although most techniques available to the diagnostic virologist have 
been used to detect human respiratory coronaviruses, even in research laborato- 
ries the range of tests employed by any one center tends to be limited. This 
chapter will give a synopsis of the range of methods used, but the reader is 
referred. elsewhere for details of methods (Myint and Tyrrell, 1994). 


A. Organ Cultures 


The method, or modifications of it, developed by Tyrrell and Bynoe (1965) 
to isolate B814 is still used by some laboratories but is hampered by the diffi- 


394 STEVEN H. MYINT 


culty of obtaining human embryonic tracheal tissue. It is the best method 
available for primary isolation of the broadest range of respiratory corona- 
viruses. - | 

Tracheal tissue is taken from 14- to 24-week embryos and planted in sterile 
plastic Petri dishes containing 199 medium. The tissue is immersed with cilia 
uppermost. Virus can be inoculated onto the cilia and then incubated at 33 °C 
for up to 10 days. Viral replication is indicated by cessation of ciliary activity 
and confirmed by interference with another virus (echovirus, parainfluenza, or 
Sendai]. Electron microscopy was used originally to confirm the isolates as 
coronavirus but other tests such as virus neutralization are now employed. 

Trachea obtained from 5- to 9-month-old fetuses have also been shown to 
support the growth of coronaviruses, and different media recipes can also be 
used (McIntosh et al., 1967a,b). 


B. Mouse Brain Culture 


OC38 and OC43 (but not B814, OC16, OC37, OC44, or OC48) have been 
adapted to grow in suckling mice brains (McIntosh et al., 1967a,b, 1970; Tyrrell 
et al., 1968b). The mice can be inoculated intracerebrally or via the peritoneum, 
with encephalitis occurring in some several days later. In the initial description 
of McIntosh and colleagues, CD-1 Swiss mice were used, with encephalitis 
occurring 11-15 days after inoculation with virus that had been passaged several 
times in organ culture. After the fourth passage, the time to illness was reduced 
to 40-60 hr. Evidence of infection was not found in other organs (e.g., liver, 
heart, or lungs). Virus can be prepared from brain suspensions by clarification 
through low-speed centrifugation and then adsorption to and elution from 
group O erythrocytes. Virus may be visualized by electron microscopy. Al- 
though this is not used for primary isolation of virus, this method is still a 
commonly used means of preparing OC43 antigen for serological assays. 


C. Cell Culture 


Cell cultures have proved to be unreliable for the primary isolation of all 
human respiratory coronaviruses, but certain strains have been adapted to 
growth in them. The 229E and related strains grow well in a continuous hetero- 
poid cell line termed C16, as they were the 16th clone of MRC-C cells that was 
selected (Philpotts, 1983). In the original description of these cells from the 
MRC Common Cold Unit in Salisbury, the morphology showed a mixture of 
fibroblastic and epithelioid cells. The former constituted three quarters of the 
cell population. These cells were contaminated with organisms detected by 
Hoechst stain 33258, presumably Mycoplasma spp. Thus, C16 cells would 
arguably be the cell line of choice for the isolation of 229E, but frequent passage 
of these cells results in a increasing proportion of the epithelioid content and a 
consequent reduction in the ability of the cell line to sustain replication of 
virus. 
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Many laboratories have continued to utilize a cell line that was originally 
used in the work of Hamre and Procknow (1966): a human diploid cell strain 
from Wistar Institute, so-called WI-38 cells. Although 229E was readily adapted 
to this cell line, primary isolation was in human kidney cells, and the authors 
noted that WI-38 cells may not be ideal for primary isolation. 

Apart from C16 and Wi-38 cells, many other cell lines have been used to 
grow individual virus strains (Hamre and Procknow, 1966; Reed, 1984; Larson et 
al., 1980; Hamre et al., 1967; Kapikian et al., 1969; Bradburne, 1969, 1972; 
Schmidt et al., 1979; Schmidt and Kenny, 1982; Tyrrell et al., 1979; Chaloner- 
Larsson and Johnson-Lussenberg, 1981; Bruckova et al., 1970). These are sum- 
marized in Table III. These viruses do not grow well in cell types commonly 
used for the isolation of other respiratory viruses such as HEp-2 or Rhesus 
monkey kidney, which makes routine identification unlikely. It is, moreover, 
clear that the ideal cell line for isolation and propagation is not yet available and 
other methods of diagnosis have to be applied. 


D. Electron Microscopy and Immune Electron Microscopy 


Electron microscopy of nasal washings is impractical as the virus load is 
usually below the level of sensitivity of standard methods. It has been used, 
however, to detect virus in tissue sections such as in mouse brain culture of 
OC43. It is also the means by which human enteric coronaviruses (HECVs) 
have been detected. Negative staining with tungsten has usually been the 
method of choice (Almeida and Tyrrell, 1967; Tyrrell and Bynoe, 1965), but 
molybdenum and uranium salts have also been used. The size of the particles 
appear greater if uranium salts are used in place of tungsten (Davies and Mac- 
Naughton, 1979). 


TABLE III. Cell Lines and Strains Used for the Cultivation of Human Coronaviruses 


Cell type Virus Primary isolation or adaptation 


Human embryonic kidney 229E Primary 
C16 (see text) 229E Adaptation = primary 
Human embryonic lung fibroblast, WI-38 229E Adaptation = primary 


Human embryonic lung fibroblast, MRC-c — 229E and OC43 Adaptation > primary 
(OC43, adaptation only] 
Human embryonic lung epithelium, L132 229E and OC43 Adaptation > primary 


Human embryonic intestinal fibroblast, 229E Primary isolation of some 
MA177 strains 

Human type II pneumocytes 229E Primary > adaptation 

Human fetal tonsil fibroblast 229E and OC43 Adaptation 

Human embryonic rhabdomyosarcoma 229E and OC43 Adaptation 

Primary monkey kidney OC43 Adaptation 

Rhesus monkey kidney epithelium, OC43 Adaptation 
LLC-MK2 

Continuous green monkey kidney OC43 Adaptation 


epithelioid, BSC-1 
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An attempt to enhance electron microscopy by utilizing antibody concen- 
tration of cultured virus has been used successfully to detect 692, virus in 
washings from an adult with upper respiratory tract infection (Kapikian et al., 
1973). Nasal washings were passaged through both cell culture and tracheal 
organ culture, and the resulting supernatant was then incubated with convales- 
cent serum from the same patient. After centrifugation the pellet was then 
examined on a Formvar-carbon-coated grid. Aggregates of virus were clearly 
discernible. Supernatant that had been incubated with phosphate-buffered sa- 
line, instead of convalescent serum, was also examined but virus particles were 
not seen. 


E. Immunofluorescence 


An immunofluorescence method has been developed and applied to the 
detection of 229E and OC43 in nasopharyngeal secretions and washings (McIn- 
tosh et al., 1967a). Sera were raised in rabbits against mouse brain-derived OC43 
and cell culture-grown 229E and used in an indirect fluorescence assay. This 
test was able to detect homologous coronavirus antigens in nasal washings 
from infected volunteers, though cross-reactivity was noted with the 229E 
antiserum in washings from volunteers who had been inoculated with OC43 
and OC44. No nasopharyngeal aspirates from 106 children who were hospital- 
ized with respiratory tract infection had detectable coronavirus antigen by this 
method. It is difficult to ascertain whether this was due to a lack of sensitivity, 
as paired sera collected from 66 children during the study period did not show 
evidence of coronavirus infection. 


F. Enzyme-Linked Immunoassay 


An enzyme-linked immunosorbent assay (ELISA) method based on purified 
229E and HECV CV-Paris (which has cross-reactivity with OC43) has been used 
to diagnose infections in children (Isaacs et al., 1983; MacNaughton et al., 1983). 
The ELISA method is a modification of that described for antibody detection 
using rabbit antisera (see Section VLI]. In a study of 30 children aged 6 months 
to 6 years, 159 samples were collected: 111 nose swabs, 11 throat swabs, and 55 
nasopharyngeal aspirates. Of these, 34.2% of the nose swabs, but only 18.2% of 
the throat swabs and nasopharyngeal aspirates, were positive for either 229E or 
OC43. No comparison with serology was attempted, but the positivity rate 
would suggest that this ELISA was a sensitive test. 


G. Nucleic Acid Hybridization 


The first application of gene detection methods to detecting human corona- 
viruses was developed for 229E using Northern hybridization (Myint et al., 
1990). A cDNA that encoded the entire nucleocapsid gene for 229E was ligated 
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into a Riboprobe (Promega) vector, pGEM-1, from which 22P-labeled full-length 
transcripts could be generated. These transcripts could be made as sense or 
antisense, depending on whether an SP6 or T7 promoter as used. This method 
has been applied to the detection of 229E in nasal washings from inoculated 
volunteers (Myint et al 1989) and has been shown to be at least as sensitive as 
culture. There is also the advantage of a diagnostic result being available within 
48 hr. An interesting observation was that the probe method was able to detect 
virus for longer than cell culture in sequential samples from the volunteers. 
This probe method will not detect OC43 and attempts to remove the radioactive- 
labeling by incorporating biotin or digoxigein into transcripts have led to signif- 
icant loss of sensitivity (S. Myint, unpublished data]. 


H. Reverse Transcription-Polymerase Chain Reaction 


With the advent of gene and probe amplification strategies, it was to be 
expected that these methods would be seen as advantageous for viruses that are 
difficult to cultivate. Gene amplification methods based on "nested" priming 
have been shown to be a sensitive and specific means of detecting both 229E and 
OC43 (Myint et al., 1994). Serotype-specific nested primers were designed from 
the known sequences of the nucleocapsid genes of 229E and OC43. The inner 
primers were, in particular, chosen to produce a small fragment of about 100 
base pairs for maximum sensitivity. RNA is extracted using an acid-phenol/ 
guanidinium isothiocyanate procedure followed by reverse transcription using 
murine Moloney leukemia virus reverse transcription. Two 20-cycle amplifica- 
tion steps are then used with the outer and inner sets of primers, respectively. 
The sensitivity of the assay appears to be much greater than that of cell culture 
or probe methods and each primer pair appears to be either 229E or OC43 
specific. The use of this method has greatly enhanced the diagnostic yield of 
coronaviruses in clinical material from asthmatic children (S. L. Johnston et al., 
unpublished data). The assay is, at least, as sensitive as a combination of culture 
and serology for diagnosing infection and is more specific. Reverse transcription- 
polymerase chain reaction is likely to become the method of choice for direct 
virus detection. 


I. Serological Methods 


Because of the lack of reliable detection methods prior to the development 
of those based on gene detection, most epidemiological studies have used 
serological assays to determine evidence of coronavirus infection. The most 
widely used and sensitive format is the enzyme-linked immunoassay. The 
assay was first described for strain 229E (Kraaijeveld et al., 1980; MacNaughton, 
1982) but has since been adapted for detection of antibodies to OC43 (Schmidt, 
1984). The 229E assay uses antigen that is grown in cell monolayers and then 
clarified. The OC43 test uses mouse-brain-derived antigen. Rabbit antisera 
have been used for both 229E and OC43 tests. The specificities of the assays are 
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similar to that of counterimmunoelectrophoresis, neutralization, and comple- 
ment fixation assays but sensitivity is over 1000-fold greater. In volunteer 
studies, the 229E assay has shown a close correlation between clinical illness 
and virus shedding. It has been the principal method for determining the occur- 
rence and frequency of coronaviruses infections in serological surveys, but 
recent data suggest that some false-positive and false-negative reactions occur 
(Myint et al., 1994). The 229E assay also detects antibody rises to some 229E- 
like viruses (PR, KI, and TO]. 

Other serological test formats have also been used: indirect hemagglutina- 
tion (Kaye et al 1972) and immune-adherence hemagglutination (Gerna et al., 
1978) for 229E antibody; rapid microneutralization (Gerna et al., 1979) and 
plague-reduction (Gerna et al., 1980) for OC43 antibody; and immunofluores- 
cence (Monto and Rhodes, 1977), complement fixation (Hovi, 1978), and single 
radial hemolysis (Hierholzer and Tannock, 1977; Riski et al., 1977) for both 
229E and OC43 antibody. These test formats have been superseded by the 
ELISA test. 
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CHAPTER 19 


The Pathogenesis of 
Torovirus Infections in Animals 
and Humans 


MARION KOOPMANS AND MARIAN C. HORZINEK 


I. INTRODUCTION 


In 1992, the new genus torovirus was added to the family Coronaviridae 
(Pringle, 1992; Cavanagh and Horzinek, 1993), ending a period of controversy 
about the assignment of this taxonomic cluster. Toroviruses are enveloped, 
positive-stranded RNA viruses that may cause enteric infections in animals 
and humans. The first descriptions mentioned superficial morphological re- 
semblances with coronaviruses, both being 80-120 nm, enveloped, peplomer- 
bearing particles with a pleomorphic appearance as seen by electron micros- 
copy (EM) (Weiss et al 1983; Woode et al., 1982). However, further studies 
revealed morphological and antigenic differences, sparking discussions about 
the place toroviruses should occupy in taxonomy (Horzinek et al 1984, 1985, 
1986, 1987; Horzinek and Weiss, 1984; Koopmans et al., 1986; Weiss et al., 1984; 
Weiss and Horzinek, 1986, 1987; Woode et al., 1982, 1985). The matter was 
settled when studies of the replication mechanism and genomic sequence 
showed fundamental similarities between toro- and coronaviruses (reviewed by 
Snijder and Horzinek, 1993; Bredenbeek et al., 1990; den Boon et al., 1991; 
Snijder et al., 1988, 1989, 1990 a-c; see also Chapter 11, this volume]. 

The torovirus prototype Berne virus (BEV) was isolated in Berne, Switzer- 
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land from a rectal swab taken from a horse with severe gastroenteritis 1 week 
before it died (Weiss et al., 1983). Salmonella lille, isolated from the same swab, 
was considered to be the cause of the disease in this animal, and BEV has 
remained a virus in search of a disease. However, since it grows in cell culture 
and has therefore been extensively studied by the Utrecht group, BEV has been 
designated the torovirus prototype. 

The situation is quite different for another torovirus that had been de- 
scribed 1 year before; it had been found by EM in feces from calves in a dairy herd 
in Breda, Iowa, that had severe diarrhea (Woode et al., 1982). Breda virus (BRV) 
does not replicate in cell or tissue culture, but has been identified as a pathogen 
causing gastroenteritis in calves and possibly in older cattle (Koopmans et al., 
1990, 1991c; Saif et al 1981; Woode et al 1982, 1985). In the years following the 
discoveries of BEV and BRV, serological evidence of torovirus infection has been 
obtained in all ungulates that were tested using a BEV neutralization test 
(horses, cattle, sheep, goats, pigs], and in rats, rabbits, and some species of feral 
mice (Weiss et al., 1984). Also, toroviruslike particles have been detected in 
stool specimens from pigs (Penrith and Gerdes, 1992; Durham et al., 1989; Scott 
et al., 1987; L. Saif, personal communication), humans (Beards et al., 1984, 1986; 
Brown et al., 1987; Koopmans et al., 1991a, 1993b), cats (Muir et al., 1990], and 
dogs (Hill and Yang, 1984). There is little doubt that solitary torovirions have 
been seen by electron microscopists, but their pleomorphism precluded their 
identification as viruses since confirmatory testing was unavailable (Koopmans 
et al., 1991a, 1993b; Liebler et al., 1992). No antibodies have been found in the 
sera of cats and humans (Brown et al 1988; Weiss et al 1984). 

The purpose of this chapter is to highlight epidemiological and clinical 
studies with an emphasis on the pathogenesis of toroviruses. We will focus on 
bovine toroviruses since most information on the infection in vivo has been 
obtained from studies in cattle. For an update on the structural and morphologi- 
cal properties of the virus, on its replication strategy, and on diagnosis of 
torovirus infections in animals and humans, the reader is referred to other 
recent reviews (Koopmans and Horzinek, 1994; Snijder and Horzinek, 1993) and 
to Chapter 11, this volume. 


I. INFECTION IN CATTLE 
A. Enteric Infections 


. Three strains of BRV have been used to study the course of infection in its 
natural host. Besides the original "isolate," two additional strains of bovine 
enteric toroviruses have been reported; the three strains were assigned to two 
groups (BRV1 and BRV2) based on antigenic comparisons: BRV1 refers to the first 
Iowa strain, and BRV2 comprises the two strains that had been detected in feces 
from a 5-month-old diarrheal calf in Ohio and from a 2-day-old calf in Iowa (Saif 
et al., 1981; Woode et al., 1983, 1985). All three strains are pathogenic for 
newborn gnotobiotic and nonimmune conventional calves, aged 1 hour to 10 
weeks (Woode et al., 1985). 
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Although the respiratory system may be sporadically involved (see Section 
II.C), BRV infections are usually limited to the gut. Between 1 and 3 days after 
oral inoculation, calves typically develop a watery diarrhea that persists for 4 to 
5 days (Woode et al 1982, 1983, 1985). The most severe symptoms occur within 
2 days after onset of the diarrhea, with dehydration, weakness, and depression 
(Woode, 1987). In the presence of a normal intestinal flora (but in the absence of 
maternal antibodies), diarrhea generally is more severe than in gnotobiotic 
calves. Both crypt and villus epithelial cells are infected from the midjejunum 
through to the large intestine (Woode et al 1982; Woode, 1987). 

Infections with BRV seem to be ubiquitous, as evidence of infection has 
been obtained in every country where serological and/or virological studies 
were done: Belgium, Great Britain, France, Germany, India, Italy, The Nether- 
lands, Switzerland, and the United States (Vanopdenbosch et al., 1992a; Brown 
et al., 1987, 1988; Lamouliatte et al 1987; Liebler et al 1992; Koopmans et al., 
1989; Weiss et al., 1984; Woode et al 1985). In addition, unconfirmed torovirus- 
like particles have been found in cattle from South Africa (Penrith and Gerdes, 
1992) and New Zealand (Horner, personal communication]. 

The infections are quite common in dairy cattle; by 1 year of age, 85-95% of 
the animals have antibodies to BRV (Koopmans et al., 1989; Weiss et al., 1984; 
Woode et al., 1985). In a study among dairy cattle in The Netherlands, most 
seroconversions took place between 6 and 12, months of age, after the waning of 
maternal immunity (Koopmans et al., 1986). 

The presence of maternal antibodies in calves (in 9596 of the animals) did 
not prevent infection, but may have modified its outcome, since diarrhea 
generally was mild in BRV-excreting calves (Koopmans et al., 1990, 1991c}. BRV 
infections accounted for 496 of cases of diarrhea in calves in this study. BRV- 
associated diarrhea under farm conditions was clinically indistinguishable 
from that caused by the most common enteropathogens (rota- or coronavirus), 
although it lasted slightly longer (average 9.2, 6.8, and 6.8 days, respectively, for 
the three viruses) and affected slightly older calves (average 12.7, 7.7, and 8.3 
days, respectively) (Koopmans et al., 1991c}. Calves 3-4 months of age showed 
very mild diarrhea cr no diarrhea at all in association with torovirus shedding 
(Koopmans et al., 1991c). 

Besides its association with gastroenteritis in young calves, BRV is a pos- 
sible cause of diarrhea in adult dairy cows (Koopmans et al., 1991c). 


B. Pathogenesis of Enteric Torovirus Infections 


The exact pathogenic pathways that typify torovirus infections in cattle are 
not known. However, results from experimental infections of calves suggest a 
mechanism similar to that of bovine coronaviruses {BCV} (Clark, 1993). Calves 
may be infected with BRV by the oral route and possibly by the respiratory route 
(Woode et al., 1982, 1985; Vanopdenbosch et al., 1991). The epithelial cells lining 
the small and large intestine become infected, with progression from areas of 
the midjejunum down through the ileum and colon (Fagerland et al., 1986). 
Within the small intestine, not only epithelial cells at the top of intestinal villi 
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are infected but also cells in the upper third of the crypts (Fagerland et al., 1986; 
Pohlenz et al., 1984). This has important consequences for the outcome of the 
infection as can be deduced from the physiology of the gastrointestinal tract: in 
an uninfected gut there is a well-attuned balance between the sequestration of 
cells at the top of the intestinal villi and the production of new enterocytes in 
the crypts. Crypt epithelial cells rapidly divide and are pushed up toward the 
top of the intestinal villus. During this process the new cells mature into 
absorbing cells by the development of fingerlike (brush border) projections that 
increase the cell surface many times and by the production of brush border- 
associated enzymes. Following a toroviral infection, the turnover rate of epithe- 
lial cells increases because infected cells die and detach (Fagerland et al 1986; 
Pohlenz et al., 1984}, leading to an efficient spread of the infection. In fecal 
preparations from experimentally BRV infected calves, hemagglutination (HA 
titers up to 3 x 10’ units/ml have been measured (Woode et al 1983], corre- 
sponding to virus titers of 1011 to 1012 (Zanoni et al 1986). With such high titers 
the infection will spread rapidly. The lost epithelium is replaced by immature 
cells that do not yet possess a mature brush border, thereby presenting insuffi- 
cient absorptive surface and digestive enzymes (15-65% reduction in p-xylose 
resorption in BRV-infected calves) (Woode et al., 1982, 1985). Since BRV infects 
both crypt cells and the villus epithelium, restoration of the normal structure 
and function may be slower than after infections that affect only the villous 
epithelium (e.g., by rotaviruses and BCV). A longer duration of diarrhea can 
therefore be expected (Koopmans et al., 1991c) with more severe complications. 
The decrease in digestive and absorptive capacities leads to accumulation of 
lactose in the gut lumen, which in turn results in water and electrolyte reten- 
tion leading to diarrhea and occasionally to dehydration, acidosis, hypoglyce- 
mia, and death. 


C. Respiratory Infection 


Recently, Vanopdenbosch et al. (1992a,b) reported the isolation of a toro- 
virus from the respiratory tract of calves with pneumonia [bovine respiratory 
torovirus (BRTV}]. If confirmed, this finding would suggest that bovine toro- 
viruses are both entero- and pneumotropic, as has been clearly demonstrated for 
BCV (Reynolds, 1983; Reynolds et al., 1985; Saif et al., 1986). Most likely 
different strains are involved, because respiratory tract symptoms were rarely 
observed in calves after experimental BRV infection that suffered from diarrhea, 
whereas in the calves described by Vanopdenbosch et al. (1992a,b) respiratory 
symptoms were predominant (Koopmans et al., 1990; Woode et al., 1982, 1985]. 
Also, when testing RNA extracts from cells infected with the BRTV isolate in a 
diagnostic polymerase chain reaction assay with BEV/BRV consensus primers, 
no detectable amplification product was obtained (M. Koopmans, unpublished 
results). The "respiratory" isolate is the only bovine torovirus that has been 
adapted to cell culture, but its presence still needs to be confirmed. Using 
infected cells in an immunofluorescence test, Vanopdenbosch et al. (1992b] 
found high levels of antibody in all calves (n = 50) that persisted for 5 months. 
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These authors also detected antibodies in 20 batches of fetal bovine serum, in 
contrast with earlier findings (Koopmans et al., 1990; Woode et al., 1982). 


D. Infection of Other Organ Systems 


The unexpected finding of antibodies in all commercially available batches 
of fetal bovine serum and in 7 of 13 precolostral calf sera (Vanopdenbosch et al., 
1992b} indicated the possibility of transplacental torovirus infections in cattle; 
this concept was underscored by the demonstration of toroviral antigen in 
placental cotyledons of spontaneously aborted calves using an immunofluores- 
cence assay. The same authors suggested a possible role in central nervous 
disturbances, sudden death, and in a syndrome resembling mucosal disease 
(Vanopdenbosch et al., 1992a). 


E. BRV Infection and the Immune System 


In addition to crypt and villus epithelial cells of the small and large intes- 
tine, BRV also infects the dome epithelium overlying Peyer’s patches. The dome 
epithelial cells, including the M cells, show the same cytopathic changes that 
occur in the absorptive villous cells (Woode et al., 1984; Pohlenz et al 1984). In 
addition, the germinal centers in Peyer’s patches of BRV-infected calves were 
found depleted of lymphocytes and occasionally showed fresh hemorrhage 
(Woode et al., 1982). The M cells are specialized gut epithelial cells that endo- 
cytose macromolecules and microorganisms from the gut lumen and present 
them to underlying lymphoid tissue; they play a key role in the development of 
an immune response against enteric pathogens. Direct or IgA-mediated adher- 
ence to M cells has been observed for other viruses that infect them (bovine 
astrovirus) or use their transepithelial transport capacity to cross the epithe- 
lium and to invade neural and lymphoid tissues (reovirus and poliovirus) (Pear- 
son et al., 1978; Sicinski et al., 1990; Weltzin et al., 1989; Wolf et al., 1981). 
Infection of M cells by BRV and bovine astrovirus results in a cytopathic effect, 
while poliovirus and reovirus infections are noncytopathic. It is unknown 
whether BRY (or astroviruses) pass the epithelial lining via the M cell pathway; 
no such evidence was obtained in clinical and pathological studies (Fagerland et 
al., 1986; Woode et al., 1982, 1985). 

The tropism of BRV for dome cells may have consequences for the (mu- 
cosal) immune response. Development of BRV-specific IgM antibodies inter- 
estingly occurred very late after primary infection of sentinel calves (< 1 month 
of age), and no memory response was seen in them after a second BRV infection 
at 10 months of age (Koopmans et al., 1990). An alternative explanation for these 
findings may be the suppressive effect of maternal antibodies on infection: In 
experimental BCV infections the immune response was delayed in calves that 
had been fed colostrum with high titers of specific antibody compared with the 
response in calves fed low-titered colostrum (Heckert et al., 1991). 
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F. Chronic Infection 


Chronic torovirus infections also may occur, as data from a recent epidemi- 
ological study suggest. In a closed herd of 10 dairy calves repeated torovirus 
shedding was observed in several animals, with intervals of several weeks to 4 
months. When the calves were introduced into a herd of adult cows, they all had 
an episode of diarrhea in association with BRV shedding, followed by serocon- 
version. The adult cows showed no evidence of active BRV infection in the 2 
weeks preceding the arrival of the calves, but they all had high levels of preexist- 
ing antibodies (Koopmans et al., 1990). These observations indicate the pres- 
ence of carriers that shed low levels of virus or undergo recurrent subclinical 
infections. Virus persistence and shedding may be an important source of virus 
in epizootics of neonatal torovirus infections. Infections with BCV may serve as 
a model for understanding the situation in bovine torovirus infections, given 
the close resemblance in other aspects of the pathogenesis of these viruses. 
Evidence of coronavirus shedding has been found in up to 75% of clinically 
normal cows (Collins et al., 1987; Crouch and Acres, 1984), but only 5% were 
shedding free virus that could be detected in regular enzyme-linked immuno- 
sorbent assay (ELISA); the remaining 70% were identified using an ELISA for 
the detection of immune complexes (Crouch and Acres, 1984). In a follow-up 
study, free virus could be detected intermittently in some animals, but immune- 
complexed virus was present in the feces throughout the 12 weeks of the study 
(Crouch et al., 1985). Chronic shedders can be a source of infection as shown by 
Bulgin et al. (1989): calves from carrier cows had a 60% chance of developing 
clinical illness, whereas those from noncarriers had only a 22% chance. 

With the development of highly sensitive, polymerase chain reaction- 
based detection methods for toroviruses, the role of carriers in the epidemiology 
and pathogenesis of torovirus infections can now be addressed (Koopmans et al., 
19932]. 


III. INFECTION IN HORSES 
A. A Virus in Search of a Disease 


Similar to the situation for bovine toroviruses, infection of horses is quite 
common in the populations that have been examined; 8196 of the adult horses 
in Switzerland and The Netherlands possess neutralizing antibodies to BEV 
(Weiss et al., 1983, 1984), 3596 in Germany, and 38% in India (Liebermann, 
1990). 

Equine toroviruses may cause similar disease pictures as BRV in calves, but 
epidemiological studies in populations most likely at risk have not been done so 
far. Diarrhea in young foals is a big problem, and epidemiological studies should 
be aimed at this age group («1 month of age). Infection experiments with BEV 
have been very limited: two yearlings that had been injected intravenously with 
10’ TCID,, of tissue culture grown BEV seroconverted without clinical symp- 
toms (Weiss et al., 1984). Virus shedding was not monitored in these horses. A 
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3-day-old gnotobiotic foal was inoculated orally, and again no symptoms were 
seen, although virus shedding and seroconversion occurred. Attempts to infect 
a 3-month-old foal were unsuccessful (Dr. F. Scott, Moredun Research Institute, 
Edinburgh, England, personal communication, 1992). Tissue culture-passaged 
virus was used in these experiments, which may be of low virulence as a result 
of the adaptation. 


B. The Torovirus Mutant BEV 


Irrespective of many attempts (Weiss, unpublished observations], isolation 
of BEV has been a unique event and could only be repeated with the field sample 
from the same horse. This observation suggests that the Berne isolate is a 
mutant. 

Most parts of the BEV genome have been sequenced; it would be interesting 
to obtain sequence information from pathogenic toroviruses and to look for 
explanations for the difference in pathogenicity. Of special interest is a pre- 
sumed pseudogene [open reading frame 4 (ORF 4]] that has been identified in the 
BEV genome. The predicted amino acid sequence of the ORF 4 product bears 
similarities to the C-terminal part of the hemagglutinin esterase (HE) of BCV 
and of human influenza C virus; however, the 5' two-thirds of the HE gene are 
missing, and an ORF 4 product has not been identified in BEV virions or in 
lysates of infected cells (Snijder et al., 1991). The BCV HE (like the influenza C 
virus HE] uses N-acetyl-9-O-acetylneuraminic acid as a receptor to initiate the 
infection of cultured cells and probably plays a role in infection of host cells in 
vivo (Clark, 1993; Schultze and Herrler, 1992). Recently, 2 kb of the 3'-end of 
BRV has been sequenced; it was demonstrated that BRV has a complete HE gene 
that is probably functional (L. A. H. M. Cornelissen and R. J. de Groot, unpub- 
lished observation; Koopmans et al., 1986). 


C. Host Range 


Infection with BEV in vitro is limited to cells of equine origin. Neutralizing 
antibodies have been found in sera from cattle, goats, sheep, pigs, rabbits, and 
feral mice, indicating a close antigenic relationship between toroviruses of 
these species or cross-species infections with one or a few related toroviruses. 
The latter option is not likely. Mouse immune sera raised against BRV2 show 
very little cross-reactivity with BRV, except at the level of the peplomers; the 
sera recognized the polypeptides of the homologous virus and the two highest- 
molecular-weight proteins (105 kDa and 85 kDa) of BRV1 in radioimmune 
precipitation. The same sera inhibited hemagglutination of the heterologous 
serotype and efficiently neutralized the infectivity of BEV (Koopmans et al., 
1986]. 

No evidence of viral replication was obtained in rats, mice, or lambs that 
had been experimentally infected with BRV (Woode et al., 1982; Woode, 1987). 
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In contrast, the respiratory bovine toroviruses reportedly replicate in cells from 
a wide range of host species (Vanopdenbosch et al., 19925]. 


IV. TOROVIRUS INFECTIONS IN OTHER SPECTES 


Toroviruslike particles and torovirus antibodies have been found in species 
other than the cattle and horse, indicating that these viruses may infect a broad 
range of animal hosts (Muir et al., 1990; Scott et al., 1987; Weiss et al., 1984). 
Although in most cases stool specimens from animals with diarrhea were 
examined, the pathogenic role of these toroviruses remains unclear, and epide- 
miological studies are needed to study the causal relationship between virus 
presence and disease (Durham et al., 1989; Hill and Yang, 1984; Muir et al., 1990; 
Scott et al., 1987). 

The toroviruslike particles found in humans (Beards et al. 1984, 1986} 
cross-react antigenically with BRV (Koopmans et al., 1993b]; preliminary data 
from an ongoing epidemiological study in Brazil indicate an association with 
diarrhea (Koopmans and Guerrant, submitted for publication). 
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